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authors first derive some formulas of Keplerian motion 
involving their six elements, then the perturbation equations, 
and finally, present the first order solution. It is inter- 
esting to observe that no critical angles occur in the second 
order solution, but that they will appear in a third order 
solution. 

The tenth paper by R. E. Wheeler of Hayes International 
Corporation presents a statistical procedure for estimating 
the -accuracy that can be expected of a given guidance func- 
tion. Variations due to changes in launch times, vehicle 
parameters, and other disturbances are considered. The 
procedure establishes an upper bound for 2-sigma limits and 
checks the validity of such limits. 

The eleventh paper by R. E. Wheeler of Hayes Inter- 
national Corporation presents the derivation of a mathematical 
model for fitting the steering function. No end conditions 
were considered since all constants of integration were 
combined with unknown constants in the expansion. 

The twelfth paper by Daniel E. Dupree, James O'Neil, 
and Edward Anders of Northeast Louisiana State College 
presents a method of developing a function 9 n+i(| 3’) 
previously derived in Progress Report No. 5. The method 

is detailed in the report and will be implemented here in 
the near future . 
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DIRAC'S GENERALIZED HAMILTONIAN DYNAMICS 
AND THE PONTRYAGIN PRINCIPLE 


by 

Mary Payne 


SUMMARY 



Dirac's generalized Hamiltonian dynamics is described and applied first 
to a particular optimization problem and then to a general class of such problems. 
It is shown that the Dirac formulation leads to a Hamiltonian to which the 
Pontryagin Maximum Principle can be applied. Further, this Hamiltonian has 
the property of being canonical in all of its variables, and is thus susceptible 
to treatment by the methods of classical celestial mechanics. The report 
closes with a brief discussion of how perturbation techniques, based on the 
Dirac Hamiltonian, might be developed for the solution of optimization problems. 
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I. INTRODUCTION 


The purpose of this report is to formulate a generalization of the 
Pontryagin approach for application to optimization problems. This generali- 
zation will add nothing new to the basic equations to be solved, but is, rather, 
intended to lead to perturbation procedures for the solution of these equations. 

In the Pontryagin formulation of optimization problems a function which bears 
close resemblance to a Hamiltonian function is introduced. It differs from 
most classical Hamiltonian functions in two respects: First, the classical 
Hamiltonian for most problems in dynamics is quadratic in the momenta 
whereas the Pontryagin Hamiltonian is linear. The second difference is that 
the Pontryagin Hamiltonian is canonical only in the state variables and their 
conjugate momenta. In the Pontryagin approach, the control variables are 
determined, not from Hamilton equations, but by the Pontryagin maximum prin- 
ciple which says that the Hamiltonian must be a maximum in the control variables. 
The generalization consists in defining a new Hamiltonian, to which the maximum 
principle can still be applied, but which is canonical in all the variables. The 
advantage of this new Hamiltonian is that all the methods of classical dynamics 
now become available for the solution of the problem. In particular, the classi- 
cal perturbation theories can be applied for obtaining successive closed form 
approximations for the solution. Most current efforts to solve optimization 
problems involve numerical integration with the serious defect that initial values 
of the momenta must be found from an initial set of trial values by some differ- 
ential correction procedure whose success will in general depend on how close 
these trial values are to the actual initial conditions. 

The construction of the new Hamiltonian is based on a technique developed' 
by Dirac for problems in which the Lagrangian function is linear in the velocities. 
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It is shown in Section m that the construction of a Hamiltonian for such problems 
involves special difficulties that are not present in the usual problems of 
classical dynamics for which the Lagrangian is quadratic in the velocities. 

Dirac's motivation for this work was his interest in relativistic gravitational 
fields and quantum electrodynamics. In both problems the Lagrangian is 
linear in some of the generalized velocities, so that the difficulties that are 
involved in the construction of a Hamiltonian are identical with those involved 
in optimization problems. Thus the Dirac formulation, although not originally 
intended for this purpose, can be applied to optimization problems. 

It will be seen that the new Hamiltonian, which will be referred to as the 
Dirac Hamiltonian, will be linear in all the momenta problems for optimization. 
This fact makes it very attractive from the point of view of development of a 
Hamilton- Jacobi perturbation theory since the Hamilton-Jacobi equation will be 
a linear partial differential equation of first order. 

Section II presents some general background material. In Section HI, 
the construction of the Dirac Hamiltonian is discussed in some detail. Section 
IV presents a development of the Dirac Hamiltonian for a time optimal point-to- 
point transfer problem. In Section V the connection between the Pontryagin and 
Dirac Hamiltonians is discussed for the example of Section IV, and in Section V 
the theory is extended to more general problems. Finally, Section VII presents 
a brief discussion of the ways in which perturbation procedures might be developed 
for the solution of optimization problems. 

H. BACKGROUND 

In the Pontryagin formulation of optimization problems , the variables are 
classified as state variables x. which must satisfy certain equations of motion 
and control variables y. which appear in the equations of motion: 

k. = f. (x,y) , i = l,2,...n. (1) 
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From the state variables x. and a set of adjoint or conjugate variables 0. a 
Hamiltonian function Hp is constructed which is canonical in the variables 
x. and their conjugate momenta 0 ., That is, the Hamilton equations 

9H p . SH p 

^i 3 0. ’ ^i dx. ’ 

l 


( 2 ) 


are satisfied. The Hamiltonian is constructed so that the Hamilton equations 
for x. are just the equations of motion, and the equations for 0 . serve to define 
the conjugate functions 0 .. The Hamiltonian Hp is not canonical in the control 
variables y. since no momenta conjugate to the y. appear and hence the y. are 
not given by partials of Hp with respect to their momenta. The subscript P is 
used to distinguish the Pontryagin Hamiltonian from a conventional Hamiltonian 
which is canonical in all of its variables. 


For a problem which optimizes x q with 

x Q = f 0 (x,y), (3) 

an additional variable 0 Q is introduced and the Pontryagin Hamiltonian has the 
form 

n 

H P = I ^i f i (x ’ y) ‘ (4) 

i=0 


For a time optimal problem f Q = 1, and it is shown (page 20 of Ref. 1) that 0 q 
is a negative constant, which may be taken as -1 without loss of generality. 

As mentioned above, the Pontryagin Hamiltonian is not canonical in the 
control variables. The control variables are determined from the Maximum 
Principle which says that H p must be a maximum in the control variables if the 
optimization is a minimization. It is shown in this report that a technique 
developed by Dirac may be used to define a Hamiltonian which is canonical 
in all of the variables. This Hamiltonian is usable as a Pontryagin Hamiltonian 
for application of the Maximum Principle and has the added advantage that the 
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transformation theory of Hamiltonian dynamics is now available for the solution 
of optimization problems. It is evident from Eq. (4) that H p is linear in the 
momenta and this property will also hold for the Dirac Hamiltonian H p> 
which in fact is linear in all the momenta p. conjugate to the coordinates q. , 
which will be seen to include not only the state and control variables, but also 
the Lagrange multipliers associated with the Lagrangian formulation of the 
problem. Thus, for example, the Hamilton- Jacobi equation obtained by substi- 
tuting 

SS(q,a) 

P i = dq. 


in H D will be a linear partial differential equation for the generating function S. 
Its solution would lead to a canonical transformation, defined by S, to new 
canonical variables a. and Q. obtained from Eq. (5) and the following equation: 


V 


9S(q,a) 

9a. 


( 6 ) 


The Hamiltonian may be written 


h d = H D <“i> 

in terms of the new variables , so that 


(7) 


S H 


D 


“i 9Q, 


= 0 


SH 


Q. = . 
i 9 a. 


D 


= !/'.= constant 


(8) 


or 


Q i = v i t+ 


(9) 


Even if the Hamilton- Jacobi equation is not solvable, the standard perturbation 
procedures of celestial mechanics would now be available by writing as the 
sum of H dq and H D1 with H^q selected to represent a solvable problem and H D1 
treated as a perturbation (Ref. 2, pp. 62-74). 
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In order to obtain the Dirac Hamiltonian H^, it is necessary to start from 
a Lagrangian formulation. For a time optimal problem the Lagrangian function 
is 


L = 



( 10 ) 


where the X. are the usual Lagrange multipliers associated with the equations 
of motion regarded as differential constraints. To pass from a Lagrangian to 
a Hamiltonian formulation, one first defines momenta p. conjugate to the 
variables q. (which include the x^, y. and X.) by the equation 


Pi = 


S_L 

% 


(11) 


For the Lagrangian (10), the momenta conjugate to x., y i and X. are 


P . = 

xi 


a l 
ax. 


= x . 


P Xi = 


a l 

a^i 


= o 


p . = 


a l 


yi ^ 


= 0 


( 12 ) 


The Hamiltonian is conventionally defined as the function 


H = ^ p. ~ L . (13) 

i=l 

It is readily shown that this Hamiltonian is a function only of the q's and p's and 
is independent of the ti's. This is done by considering the variation in H produced 
by variations in the q's, 4's and p's consistent with the defining relations (11) 
for the p's, but otherwise arbitrary: 

n n n ^ L “ 9 L 

6H= }' P i 6^. + ^ ^sp.- 6%- 64i 

i=l i=l i=l i=l (14) 

= E ^ i 6 Pi-||r 6 ^ 

i=l i=l ^ 
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The variation in H is independent of the variations in the fy's and hence it must 
be possible to write H in such a way that it depends only on q's and p's. In 
conventional problems in dynamics , Eqs. (11), defining the p's, may be uniquely 
inverted to give the fj's as functions of the q's and p's. These expressions for 
the tj's may then be substituted for H in the defining Eq. (13) to give a unique 
expression for H as a function of q's and p's. 

For the optimization problem, with the Lagrangian (10), the relations of 
Eq. (14) still hold, so that the Hamiltonian is still independent of the Cl's. It is, 
however, no longer unique, as may be seen by direct use of Eqs. (12) and (13): 

n n n n 7 n 



n 

= ) X. f . - 1 , n 7 = number of control variables 

Lj 1 1 

i=l 

since the first three sums vanish by virtue of Eqs. (12). One can again make 
use of Eqs. (12) to write the Hamiltonian as 

n 

H> p xi f i ~ 1 < 16 > 

i=l 

which has, of course, the same "value" as H, but has a different functional form. 
The form (15) would require that all velocities vanish if it is considered as a 
"true" Hamiltonian, canonical in its variables. The form (16) is substantially 
the Pontryagin Hamiltonian and is canonical in the state variables. 
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m. THE DIRAC HAMILTONIAN 


In References 3 and 4, Dirac has developed his Hamiltonian formulation 
for problems in which constraints among the coordinates and momenta are 
implied by the defining equations for the momenta. The treatment in Reference 
3 is more detailed and also more difficult to read than that in Reference 4. 

Most of the development in Reference 3 is for a Lagrangian homogeneous of the 
first degree in the velocities. While this restriction involves no loss of general- 
ity (the Lagrangian may always be transformed to this form, as shown in Refer- 
ence 5), it does not appear in Reference 4. The results of the two analyses are 
substantially the same. The treatment in Reference 4 is in a form more useful 
for optimization problems. The contents of References 3 and 4 are presented 
below, for direct application to optimization problems. 

The starting point for Dirac's development is a Lagrangian which is a 
function of N generalized coordinates q. and their velocities fj. : 

L = L(q,fj ) (17) 

from which momenta p^ conjugate to the coordinates q. are defined by 

_ S L 

p i “ a ti i (18) 

As noted in Section H, if Eqs. (18) may be inverted to give each tj. as a unique 
function of the q's and p's, the classical Hamiltonian development follows. If 
this is not the case, the classical definition of the Hamiltonian becomes ambiguous, 
as illustrated by Eqs. (15) and (16). Actually these two equations are special 
cases of an infinite number of forms for the Hamiltonian: 

H l 4 LVm (19) 

m 


16 


where is any form such as in Eqs. (15) or (16), the are arbitrary functions 
of the q's and p’s, and the cp's represent the constraints among the q’s and p's 
implicit in Eq. (18) defining the p's: 

<p m <q,P) = 0 (20) 

These constraints may arise because some of the q's do not appear in Eqs. (18) 
or because of redundancy of these equations in the q's. Strictly speaking, the 
expressions (19) cannot really all be regarded as Hamiltonians since by a 
Hamiltonian one usually means a function of coordinates and their conjugate 
momenta such that the Hamilton equations 




( 21 ) 


are equivalent to the equations of motion of the system described by the 
Lagrangian L. Thus, the question that Dirac asks is "How may coefficients 
u m c h° sen from all arbitrary coefficients a m in Eq. (19) so that, given some 
H^ satisfying 


H 1 = X Pi ^i - L (22) 

i 

the function 

H = H. + ) u <p (23) 

1 Z_i m v m 

m 

is the Hamiltonian for the Lagrangian system L?" As shown in Section H, the 
function H^, defined by Eq. (22), may be regarded as a function only of q's and 
p's. Since the(p m are also functions only of q's and p's, the function H of Eq. 

(23) satisfies the first condition for a Hamiltonian, i. e. , it is a function only of 
coordinates q. and their conjugate momenta p.. It remains to determine the u m 
as functions of the q . and p. such that the Hamilton equations describe the motion 
of the system. It will turn out that the Hamiltonian so obtained is not unique. The 
essential reason for this is discussed at the end of this section. 
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It is necessary to make a few remarks about the functions before 
proceeding. These functions are assumed to form a complete, independent set 
of constraints on the q's and p's implied by Eqs. (18). The term "independent" 
means that no constraint, say cp^ is implied by the remaining constraints. In 
this connection, it should be noted that independent constraints cp^ = 0 and 
independent functions <p^ are not synonomous terms. The functions q and q 1 
are independent but the constraints q = 0 and q^ = 0 are not independent; each 
implies the other. The term "complete" means that every constraint implied by 
Eqs. (18) is also implied by Eqs. (20) and conversely. It is obvious that the 
number of constraints M cannot exceed the number of coordinates N. If the 
Lagrangian is independent of some velocity, say it follows that the momentum 
p^ conjugate to vanishes so that one constraint would be 

<Pl = P k = 0 ( 24 ) 

If the Lagrangian is homogeneous of the first degree in the velocities, the 
momenta will be homogeneous of degree zero in the velocities and hence depend 
only on the ratio of the velocities. Since there are only N-l independent ratios 
of velocities and there are N p's, at least one constraint among the q's and p's 
must exist. Still another way in which constraints might arise occurs when the 
velocities and appear, for example, only in the form ^ + i^. Then 

p l = p 2 = + H 2 > (25) 

and the corresponding constraint is 

<P = p l - P 2 = 0 (26) 

In the following development the assumptions made on the nature of the 
constraints is that they be independent, complete, and differentiable. The pur- 
pose of this last condition will appear immediately. 

It has already been seen (Section II) that the variation in H^, induced by 
variations in the q's, fj's and p's consistent with the defining equations for the 
momenta, may be written 
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(27) 


6H i = lVPi-[ir 6<! i 


The condition on the variations in the q's, tj's and p's implies not only that Eq. 
(18) holds (this was used to cancel out the 6 4 terms) but that they be such that 
the induced variations in thetp's shall vanish — that is that the constraints not be 
violated. Thus, the following relations among the 6 q. and 6p. hold: 


6 < Pm~ 


' r~i dcp 

Lb q. 6q i L dp. 


6p. = 0 


These equations may be interpreted as saying that of the 2 N variations, 6 q. 
and 6p., some M may be determined in terms of the remaining 2 N - M. At 
this point the meaning of the independence of the <p's may be more precisely 
stated: the (p's must be such that Eq. (28) form a consistent independent set of 


linear equations in the 6q. and 6p.. 


Recalling that is a function only of the q's and p's, and using the condi- 
tion of differentiability on L which implies differentiability of with respect to 
its variables, one may write the variation of in the form 


d H 




- d H, 


-5— *■ 6p. 

Sp. F i 


If there were no constraints the 6q. and 6p. could all be regarded as independent 
and matching coefficients of the 6q. and 6p. in Eqs. (27) and (29) would lead to 
the usual Hamilton equations. With constraints present, one may proceed as 
follows: Multiply Eq. (28) by the undetermined multiplier (-uj and sum over 
m, add Eq. (27) and subtract Eq. (29) to obtain 


L h iiL 


m dp. 
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Now think of some M of the 6 and 6p^ as being determined in terms of the 
remaining 2N - M by Eq. (28) and require that the be such that the coeffi- 
cients of these M variations vanish. The remaining (2N - M) 6 q . and 6p i may 
now be regarded as independent, so that their coefficients must also vanish. 
Thus all coefficients in Eq. (30) are to vanish and, making use of the Lagrange 
equations 


. = _d_ 3L B_L 
P i dt 3qT dq. 


(31) 


one obtains 


1 . \ S(5 m 


SH l V 

^i Bp. + h U m Bp 
m 


dH- 


S<p 


u 


m 


m dq. 


m 


(32) 


Since theo all vanish, it follows that for any variable x 

'1X1 


Scp T 


Su 


Sep. 


S x U m U m S x + S m u m S x 


m 


(33) 


and hence, defining the Dirac Hamiltonian 


H D ~ H 1 + Zj u m ^m 


(34) 


one may conclude that is a Hamiltonian with Hamilton equations: 


B H 


**i Bp 


D 


BH 


D 


3q. 


(35) 


The coefficients u may be determined as functions of the q's and p's as 
m 

follows. The equations of motion (35) obtained from the Hamiltonian (30) must 
be consistent with the constraints (20). This means that not only must the <p m 
vanish, but so must their time derivatives. That is, for each m 
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( 36 ) 


<P 


m 


y£»4. + y^ 

L 9 q. l L 


9p. P i 


V 5( *W aH l , y S< Pm ^ 

L 9q. V dp. L U m 9p. s 
i m 

r^fWSHj. y 

^9Pj vgqj L m d ' 

i m 

It generally happens that no u's will appear in some of Eqs. (36). In this case, 
additional constraints among the q's and p's appear, whose time derivatives 
must also vanish. Those constraints associated with the defining equations for 
the p's are denoted by <p m and are called primary constraints. All other con- 
straints are denoted by Xj and are called secondary constraints. Only the 
primary constraints appear in the Hamiltonian. All constraints must have 
vanishing time derivatives, so that secondary constraints arising from <p m = 0 
may lead to additional secondary constraints. This process of equating time 
derivatives of constraints to zero must be repeated until no further secondary 
constraints appear. There will then remain a number of equations for the u m 
which may be insufficient to determine all M of the u . The case in which the 
remaining equations are insufficient to determine all of the u m requires special 
discussion. Any inconsistency in either the constraining equations or the 
equations for the u m indicates an original Lagrangian formulation containing 
inconsistencies. 

To see how this process works in detail, it is desirable to introduce the 
Poisson Bracket notation. If £ and rj are two dynamical variables (functions of 
q's and p's) their Poisson Bracket (P. B. ) is defined by 
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from which it follows immediately that 


u , t }] = -[ tj ,£] zt,a-o 

3 = [£#»?] + < 38 > 
lt,vC\ = r)li>£ 3 +CU.»?3 

The Usefulness of this notation lies in the following relation : 




Y "S£_ 5H P 5£ 5H P 

Y Laq. Sp. Sq. 9q. 




(39) 


where use has been made of the Hamilton Eqs. (35). Recalling the definition 
(34) for HpOne obtains 


«=[{• H l + Z Vm] 


m 


£> Hp + ^ u m L £» + ^m L U m _ 


m 


m 


(40) 




Z u m _ ^ ’ ^m . 


on making use of Eqs. (38). 

The condition that a primary constraint have vanishing time derivatives 
may now be written 


22 


( 41 ) 



It may happen that for some cp^, Ctp^, <p m ] vanishes for all m, and in this case 

[<Pk» H i] = 0 = *1 ( 

would appear as a secondary constraint. Secondary constraints could also 
arise by elimination of u’s among some of Eqs. (41). Let the independent 
secondary constraints obtained from Eq. (41) be denoted by Xj. It is now re- 
quired that all X- should vanish; that is 

V[*l' H l] + Z u n.[M , m] < 

m 


and Eqs. (43) may lead to further secondary constraints. When all the secondary 
constraints have been found, there will remain a number of independent linear 
equations in the u^. 


It is now necessary to provide a further classification of the constraints. 
A constraint is defined as first class if its P. B. with and with every other 
constraint vanishes either identically or by virtue of the constraints. All other 
constraints are second class. Suppose that a set of the primary constraints, 
denoted by is first class. It follows that 


<Pk» H il ^m^ [^' X i„ 0 


(44) 


Thus 


<Pk = [<Pk’ H l] + I u mtw] =0 


m 


$m' “[ < Pm # * H l] + I vfon'**: 

m/k 


m 


= 0 


(45) 

(cont’d on next page) 
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(45) (cont'd) 



H 


1J 


>: 

m^k 


u 

m 


[Xi-'Pm ]' 0 


and none of the equations requiring time derivatives of the constraints to vanish 
contain the u^. Therefore, the u^ are undetermined and the first class constraints 
appear in the Hamiltonian H with undetermined multipliers. Dirac shows in his 
paper that the multipliers associated with the second class primary constraints 
are uniquely determined by those Eqs. (45) corresponding to the second class 
constraints. The equations corresponding to the first class constraints, whether 
primary or secondary, yield no information of the u's. 


The Dirac Hamiltonian, given by Eq. (34), is now determined in terms of 
any H.^ consistent with Eq. (22) and the u's determined from Eq. (45). The u's 
so obtained will, of course, depend on the particular form selected for H^. The 
Dirac Hamiltonians obtained from different choices for may appear, at first 
glance, to have different forms. This brings up the question, noted at the be- 
ginning of this section, of the ambiguity in the Dirac Hamiltonian. It is, of 
course, immediately obvious that first class primary constraints introduce an 
ambiguity since their u coefficients are undetermined. There is a further ambiguity 
which arises from the fact that the Hamiltonian has been constructed to be a 
function only of q's and p's. Further, the Hamilton equations are satisfied and 
are such that all constraints are maintained. The validity of the Hamilton 
equations was obtained from the first order variation of H^ and the (p's. Now, 
suppose that some function g (q,p) is such that its first order variation 



2 

vanishes by virtue of the constraints. Such a function is <p^ or cos (p^ : 
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( 47 ) 


6 (WO = 2 <Pk 6(p k = 0 since <Pk = 0 

6 (cos <p^) = ~ (sin 6^ = 0 since sin <p^ = 0 


Since any such function may be added to the Dirac Hamiltonian without changing 
either the Hamilton equations or the validity of the constraints, an additional 
ambiguity is introduced besides that inherent in the existence of first class 
constraints. The Dirac Hamiltonians obtained from different choices of H^ 
all lead to the same final equations of motion and all maintain the same con- 
straints. Hence, they must differ only by functions whose first order variation 
vanishes. 


The introduction into H^ of additional terms whose first order variation 
vanishes has a very practical application: it frequently makes possible the elimina- 
tion of some of the variables from the Hamiltonian, and reduces the number of 
equations which must be solved. Just how this works is illustrated in the time 
optimal orbit transfer problem discussed in Section IV. 


IV. THE DIRAC FORMULATION 
FOR A TIME OPTIMAL TRANSFER PROBLEM 


This section illustrates how the Dirac Hamiltonian formulation is applied 
to optimization problems for the following time optimal transfer problem. For 
simplicity, the two dimensional problem is chosen. The state variables are the 
coordinates x and y, their time rates of change £ and rj, and the mass, m. It is 
assumed that initial and final values of all state variables are specified. The 
control variables are 0, the direction of thrust, and the rate of fuel flow which 
is assumed bounded between zero and some fixed upper limit /3. Thus, the 
equations of motion for the problem are: 

* = £ i = - iX.+ S&S2S ia-Btae 

s s dx m 


._ a _ 3 V . eg cos a o 

y - 77 V — ^ h — fci cos 0 

J 1 ' 3y m 

2 

m = - /3 cos a 


(48) 
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where the thrust is, of course, -cm, and the constraint on the fuel flow is 
carried by the variable a . Forces other than thrust acting on the vehicle are 
assumed derivable from a potential function V(x,y) dependent only on position 
of the vehicle. The transfer time is to be minimized, subject to the equations 
of motion (48), which are to be regarded as differential constraints. Since the 
Dirac formulation can give information only on first order variations in the time 
integral of the Lagrangian, no information on the nature of the extremals for 
this integral appears in this section, hi the next section the Maximum Principle 
is incorporated in the theory, and discussions of the nature of the solution ob- 
tained in this section are thus deferred. Introducing Lagrange multipliers, the 
Lagrangian for this optimization problem is 


L = 1 + X x (k - £) + X 2 (ir - rt) 


+ 



+ 


2 

5 V c 8 cos a 
3x m 


sin 



o 

+ ^ (rfi + 0 cos a ) 


(49) 


The Lagrangian L contains, explicitly, the differential constraints and the bound- 
ing constraints on m. It does not, however, contain the constraints on the initial 
and final values of the state variables. This omission means that the constants 
of integration from the Hamilton equations must be ultimately used to determine 
initial values for the control variables and the Lagrange multipliers. It will be 
seen later that this represents a serious defect in the theory, and that an effort 
should be made to find a Lagrangian formulation which explicitly includes all 
constraints on the problem to be solved. 

In the Lagrangian (49) the state variables x,y,£,rj and m, the control 
variables 0 and a, and the Lagrange multipliers X. will all be regarded as 
coordinates. The only velocities appearing are those corresponding to the 
state variables. The momenta conjugate to the coordinates are obtained by 
differentiation of the Lagrangian with respect to the corresponding velocities: 
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No velocities appear in the defining equations for the momenta and thus all of 
these equations represent primary constraints. Further, all of the constraints 
are independent. The constraints are labeled as follows: 




H 1 

M 

II 

•d 

CD 

11 

O 

V 2 = Py - X 2 = ° 

<*l~\ =0 

' 0 12 = p a =0 

<*>3 = p ( ' X 3 = 0 

<P8 = PX 3 = 0 


^ V X 4 =0 



«>5 = p nT X 5 = 0 

^10 = Px =° 

5 



The function is selected to be 

2 

tt \ i. . \ „ \ f dV ciScos a . /n«x 

H 1 “ X l5 tX 2"- A 3VS d Sm6 J (52) 


2 

\ c B cos a „ _ 2 

- X. I -r y cos 0 J - X c fl cos a- 1 

4Vdy m 5 ^ 

which is consistent with Eq. (22). 

To obtain the expressions for the<p's, it is necessary to obtain the P. B. 's 
of the<p's among themselves and of eachcp with H^. The P. B. 's of the (p's 
among themselves are 
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(53) 


[<p r <p 6 l= - C<p 6 , <pj] * " 1 
[<p 2 > <p 7 1 = - C<p 7 » <p 2 ^ = ” 1 

[<P 3 , <Pg] = - [<pg, CPg] = -1 a11 ° ther CfPi* = 0 

[<p 4 . <p 9 3 = - £<p 9 > <p 4 3 = - 1 
t<p5» <Pio^ = “ ^ ^10* ^5^ ~ ~ 1 


and the P.B's of the<p’s with H 4 are 


r „ 1 _ ^ a v . . 3 V 
Ltpj, Hjj a 3 2 a 4 bx'dy 

0 X 

Ccp 6 , H 4 ] = -fr 

S ^ v s 

[<p 2 , *- 3 3x3y + X 4 2 

[<P 7 , %] = -T) 

C<Pg» 


CCP 4 , H^] X 2 


. 2 

[<p 5 , — (X 3 sm0 + X 4 cos0) 

m 

r , 2 

[<PlO»HiJ = j 3 cos a 


L^ r H x ] - - £ ^ 2 - s2q? (Xg cos 6 - X 4 sin 0) 

[<p 12 , Hj] = j8 sin 2 a {—• (X 3 sin 0 + X 4 cos 0 ) - X & } (54) 

The time derivatives of the<p's are obtained by making use of Eq. (40) and they 
must be equated to zero: 


« 


4 
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• - ^ n i_v . -v a v _ n 

tPl - u 6 + X 3 Sx 2 X 4dx9y 0 

. A % afv . . a 2 v _ n 

^2 _U 7 + X 3Sxay X 4^ y 2 0 

^3 = - U 8 ' X 1 = ° 

( P4 = - U 9- X 2 = 0 


<p = - u 1Q + ^ - ^° — - a (X„ sin 0 + X 4 cos 0) = 0 
in 


* 6 = “i*«=® 

<p 7 = u 2 - ^ = ° 


^8" U 3 + dx 


S V c g cos g 


m 


sin 0 = 0 


V«4 + ^ £ *S f -““»- # 


(PlO = U 5 + 8 COS o; = 0 


CPU = - c ^ os2 « (X 3 cos 0 - X 4 sin 0) = 0 
<£>12 = £ sin 2 a (Xg sin 0 + X 4 cos 0) - X 5 } = 0 


(55) 


It will be noted that the first ten <p's give immediately the first ten u’s. No 
u's occur in the last two and hence the requirement that<pn undp^ vanish 
leads to two secondary constraints: 
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( 56 ) 


X 4 = cos a (Ag cos 0 - X 4 sin0) = 0 

X 2 = sin2 a (Xg sin 0 + X 4 cos 0) - X 5 } = 0 

where the factors c, 0 and m, known to be nonvanishing, have been omitted. 
These secondary constraints are, in a way, somewhat embarrassing since 
they both appear as products, so that further discussion requires consideration 
of the various combinations in which the factors may vanish. The occurrence 
of this problem is, however, not surprising; it is just the way in which the 
"switching function" in the conventional theory would first appear. To complete- 
ly specify the "switching function” requires consideration of second variations 
to dis tin guish minima from other stationary values of the time integral of the 
Lagrangian. There is no provision for this in the Dirac theory, and further 
discussion of this point will be deferred. First, the Dirac Hamiltonian is 
obtained and in the next section the way in which the Maximum Principle 
complements the Dirac theory is discussed. 

The ways in which the vanishing of the x's may be guaranteed are: 


Case 1. 

COS O! = 0 

Case 2. 

sin a = 0, Xg cos 0 - X 4 sin 0 = 0 

Case 3. 

x 3 = x 4 = x 5 = ° 

Case 4. 

Xg = X 4 = sin a = 0 

Case 5. 

Xg cos 0 - X 4 sin 0 = 0, (Xg sin 0 +X 4 cos 0) -X g = 0 


For a complete analysis of this time optimization problem, each of these 
possibilities should be examined in detail with recognition of the fact that the 
nature of the problem may require the use of different Hamiltonians for differ- 
ent portions of the final optimum trajectory. Since, however, the purpose in 
this report is merely to illustrate the application of the Dirac technique to op- 
timization problems, only the first two possibilities are discussed. These 
correspond to the conventional solution of the problem by the Pontryagin 
principle. It might be mentioned that the occurrence of possibilities 3,4, and 5 
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C\ a 2 v . x afv.Vn C\ a 2 v^ 

P X1V X 3 3x 2 X 4 3x5y^ P X2V A 3dxdy A 4 gy 2y 


“ X 1 P A3 - X 2 P X4 + ^2 — < X 3 Sin 6 + X 4 COS 0 > P X5 


- X ^ cos 0 + Xg sin 0 
Xg sin 0 + X 4 cos 0 P 0 


The form of this Hamiltonian differs from that of case 1 only in the pg term. 


It will be recalled that it was stated in Section IV that the Dirac Hamiltonian 
is not unique and that terms whose first variation vanishes identically may be 
added at will. One way in which differing Dirac Hamiltonians could be obtained 
would be to start with the X's in H^ replaced by the momenta conjugate to the 
state variables, which is consistent with the first five primary constraints. Had 
this been done, the resulting Dirac Hamiltonians (59) and (64) for cases 1 and 2 
would have p x , p y , p^ and p^ instead of Ap Xg, X g and Ap respectively. It is a 
relatively easy matter to show that the difference between these Hamiltonians does 
indeed have vanishing first order variation. Consider, for example, the difference 
D^ between the p^ terms: 


2 

D 1 = p Al {* X 3 " p P ~2 + ( X 4 “ 


T)' Sxdy 


for which the variation is 


2 

£>rl + ( X 4-P 'r? 


w 9x3y 




Z S 2 

S CIX 


+ ( X 3 - p ^)^6 
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The constraints <p 3 and <p^ guarantee that the first bracket vanishes and the con- 
straint <pg guarantees that the term vanishes independent of the variations in 
p xr Xg, p^, x and y. The remaining terms in the difference of the two H^'s 
are treated similarly. Thus, the obtained is essentially independent of whichever 
of the two forms outlined above is selected for H^. 


The fact that any term of vanishing first variation can be added to 
without changing its essential character may now be used to transform the 
Hamiltonians (59) and (64) into the same form. This is achieved by eliminating 
the variables 0 and pg. It is readily verified that one of the functions 

S 1 = sin 0 + p^ cos 0 T ,/ p^2 + p^2 (67) 

2 

vanishes for case 2 as a consequence of the x 2 constraint. Further, the varia- 
tion in g is given by 


6 g-L = C si 


sin 0 T 


P> 


1 2 ~ 2 . 
V p 4 + p rj 


() 6p f + (cos 0 r 


Ptl 


V p £ + p v 


2 ) 6p r) 


( 68 ) 


+ (p^ cos 0 - p^ sin 0)60 


and again from the x<> constraint the coefficients of 6 p t , 6 p„ and 6 0 vanish. 

z s *7 

Finally, since any function f multiplied by g will also have vanishing first order 

I 2 2 

variation, it follows that p^ sin 0 + p^ cos 0 may be replaced by ± ^ p^ +p^ 
in the Dirac Hamiltonian (64). Since 0 and hence 0 are undetermined by the 
Hamiltonian (59) for case 1, the same substitution may also be made there. 

The Hamiltonians now differ only in their pg terms, and since the dependence 
on 0 has been essentially replaced by p^ and p^ these terms may be omitted 
without loss of generality. 


Anticipating the results of application of the Maximum Principle, it may 
be noted that for case 2, it will be required that p^ sin 0 + p^ cos 0 must be 
positive. Using this condition, one obtains the Dirac Hamiltonian as 
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d V 3 V . c 6 cos 2 o' /_ 2 . S 

H D = p x « + p y ” - P C ST - p »,87 + m V p 5 + p „ 


^ 2 

- P m P COS 2 « - 1 + P X1 (p ? ^ + P,, 5^-y ) 
3 2 V 3 2 V^ 

+ ^X2 V p £ 3 x 3y + P T] ^ y 2 ^ P x P X3 P y P X4 


( 69 ) 


+ 


c 8 cos g 


m 



P X5 


and, finally, at this stage the terms in p. . may be omitted in the same way as the 

A l 

Pq. All of the essential information is carried by the state variables, their 
momenta, and the control variable o; with the Hamiltonian 


„ _ 3 V 3V 

D P x ^ P y ^ P £ 3 x P Tj d y 


c g cos g 

m 


P^ 2 + P^ 2 ~ p m j3 cos 2 a - 1 


(70) 


which is canonical in all the variables. This is a very compact form for the 
Hamiltonian. It has, however, one disadvantage. The momenta p^ and p^ 
enter irrationally. There may, therefore, be some advantage in retaining the 
dependence on 0, together with the two forms (59) and (64) for the Hamiltonians 
corresponding to cases 1 and 2, respectively. 


V. INCORPORATION OF THE MAXIMUM PRINCIPLE 
IN THE DIRAC FORMULATION 


The Dirac Hamiltonian obtained for the time optimal problem described 
in Section IV was written in a number of different forms. It was noted that the 
terms in the momenta conjugate to the state variables were just the Pontryagin 
Hamiltonian 
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( 71 ) 


fM. _Cj8 


H P P X ^ + P y^- p 4Vax m 


cos 


sin 0^) 




so that corresponding to Eqs. (59) and (64) 

s2, r ^2, 

„ * ;+XJ 
9x 


„ _ „ , ^ d?v_ "\ 

H D " H P 1 P X1 V x 3 a 2 X 4dxdyV 


+ P X2 C*3 a xly + X 4 ^ y 2 ) X 1 P X3 *2 P X4 
2 

+ c - ^ c ° s — 01 (X 3 sin 0 + ^ cos 0 J p^ 5 + u n p Q 
m ’’ J 


(72) 


with 


undetermined for case 1 

(73) 

- X, cos 0 + X 2 sin 0 

u, , = -r : — 5 — c 5- for case 2 

11 Xg sm 0 + X 4 cos 0 

Now the Pontryagin principle requires that H p be maximized with respect 
to the control variables. Since the only way in which H p and H p differ in their 
dependence on the control variables is in the p^. term in H p , and since p^ & 
vanishes, maximization of H p with respect to the control variables implies the 
corresponding maximization of H p and conversely. The first condition for 
maximization is that 


3H p ^H D 
3a 9a 

9H p _9H D 

T0 _ ~T0 _ 


(74) 
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These conditions are guaranteed for which has been so constructed that the 
Hamilton equations will yield vanishing time derivatives for and pg, the 
momenta conjugate to a and 0. It was these conditions which led to the secondary 
constraints with five cases to be considered. Only the first two cases, corres- 
ponding to the conventional Pontryagin formulation of the problem, have been 
analysed in detail. 

In the conventional treatment, the bounds on ifi are not explicitly written 

into the Lagrangian. To obtain the conventional Pontryagin Hamiltonian, one 

9 

could just omit the cos a factors in Eq. (71) and apply later the condition that 
the fuel flow, represented by 0 has lower bound zero and upper bound, say, 

^max’ Thus > the conventional Pontryagin Hamiltonian can be written as 

H p = p x 4 + P y 77-P c ||-P„|y + ^f(p ? s in e + P^°°sO-P m « < 75 > 


with 


0 £ a £ 0 ( 76 ) 

H 'max 

In this form H p varies linearly with 0 and hence the maximum of H p with respect 
to 0 will be on one of the bounds, and which bound is to be used will be determined 
by the sign of the switching function 

k = m ( p £ sin 6 + p n cos 6 ) “ p m (77) 

according to the criterion that 


0 = 0 


k£0 


0 = 0 


max 


k^O 


( 78 ) 


The maximization with respect to 0 requires that 
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( 79 ) 


= — (p COS 0 - p Sin ©) = 0 
5 0 m vr£ T] ✓ 

2 

^-5- = -$£ ("p sin 0 + p_ cos 0 ) ^ 0 
50 2 m ' 

The first of these conditions implies that 

p^ sin 0 + Pt? cos 0 = ± J p^ 2 + p ?? 2 (80) 

and the second requires that the + sign be used in Eq. (80) for /3 ^ 0. 

It will be noted that the Dirac formulation with the bounds on rh included 
in the Lagrangian requires (for cases 1 and 2) that the bounds of the fuel flow be 
used and that Eq. (80) hold. The selection of the positive sign in Eq. (80) and 
the operation of the switching function according to Eq. (7 8) are the essential addi- 
tional information obtained from the Maximum Principle. It should be mentioned 
that if the bounds on m were explicitly included in the Pontryagin formulation 

o 

(i. e. , by writing the constraint onm as (3 cos od the same five cases for investi- 
gation would appear as for the Dirac theory. 

The analysis of this time optimal transfer problem has shown that the 
Dirac formulation can be used instead of the Pontryagin formulation and that 
the Maximum Principle can be applied to the Dirac Hamiltonian. It is shown 
in Section VI that these conclusions can be extended to a general class of optimiza- 
tion problems. 


VI. THE DIRAC FORMULATION FOR A CLASS OF 
OPTIMIZATION PROBLEMS 

The construction of the Dirac Hamiltonian for application to more general 
optimization problems is not difficult to carry out. Suppose, for example, that 
the optimization problem is to minimize the time integral of a function f Q (x,y) 

where x represents the state variables x^, Xg x^ subject to the differential 

constraints 
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(81) 


k i = \ ( x »y) 


and y represents the control variables y^, y 2 , ... ,y^. It will be assumed that 
any bounded control variables are replaced by an expression of the form 


min 


cos a + y. 


max 


. 2 

sin a 


(82) 


where v . and y are the bounds on the control variable. A similar form 
■'rain J max 

will be employed for any bounded state variable with the differential equations 
suitably rewritten in terms of the parameter a. Thus, it may be assumed that 
the state and control variables are all unbounded. 


Introducing Lagrange multipliers, the Lagrangian for the optimization is 


N 

L = f Q (x,y)+) X. \ 

i=l 



(83) 


with coordinates x^, x 2> . . . ,x^, y^, y 2 , . . . ,y K „ X^, X 2 , . . . Xj^. The momenta 
conjugate to these coordinates are 

p . — X. i - 1,2,3, ... ,N 

XI L 

P\i = 0 (84) 

Pyj^^. 0 k ~ 1,2, ... ,K 

It is convenient to write the corresponding primary constraints in the form 

*1 = p xi - \ = 0 

♦j - P w - 0 (85) 

"k =p yk =0 

As before, the function is defined by 
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( 86 ) 


N 


H 




i=l 


and the Dirac Hamiltonian is given by 
N N K 

H D = H 1 + 1 U i <0 i + ) Vi + 1 W k W l 


(87) 


i=l 


i=l 


k=l 


where the u's, v’s and w's must be suitably determined from the requirement 
that the time derivatives of all primary and secondary , constraints must vanish. 
To obtain the time derivatives of the primary constraints, use is made of their 
P. B. ’s among themselves and with H^: 

[«v “k-Hjv *j] = [ w k- “J =o 

frv = r - 6 ij 

r ”i v s f j 9 f o 

L«V H lJ = - Z X j^ i + sZ (88) 

i 1 1 

H i] = - *i 

r n r 9f - M n 

L"k> H i] = -l x jai + 5K 


J 

from which one readily obtains 
N N 


K 


1 u jbv < ' , j] + L v j[«v ^j] + Z w kti’ w kJ - - v i 

j=l j=l 


k=l 

K 


N N 

1 u j k- i>j 1 + 1 v i [*i- *) } * 1 w k [*t- “k i = u i 
j=i j=i 


k=l 


(89) 


(cont'd on next page) 
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N 

N 

Is 


3=1 

j=i 


v a£ i 

• 

( 0 . — ■ 
■1 

- T i-A x j at 


(89) (cont'd) 


2). = u. - f. = 0 

^iii 


r . 3f. Sf n 

£b =.Sx.^ + r -2-=0 

l L 3 9y £ by^ 


From the £i and ty equations one obtains the u's and v’s: 

sf. af A 


V i " L X j 3x. + Sx. 

j 


“i = t i 


The to equations do not contain any of the undetermined multipliers u., v., 
and hence are secondary constraints x 


-Is 


!i 3f « „ 

3y e 3y « 


whose P.B.'s are: 


-V Pi] 


a 2 f, a 2 f. 


- x s 


, 3 9x i^ 9 


V *lT 


(cont’d on next page) 


42 


so that 


9 2 f. 


Lv w kl _ l X j byiby. + 


B 2 f„ 


? j a *k ay i 9y e Sy k 


>r H i] = 0 


N 


N 


A 


X i"I U iC-I W 


A 


i=l 

N 


j=l 


i ay i a V y i 


y 


Sf. 


K 


N 


B 2 f. 

v,^+> w,_[-^ Xj— L 


9 2 f, 


0 


M M *“ )=X ‘»V*1 »V»I 


(93) (cont'd) 


(94) 


These equations may or may not lead to further secondary constraints depending 
9 2f 

on the r — fr . At any rate, completion of the calculation of the w, and deter- 

dy k Sy £ 

mination of the existence of first class constraints is a routine matter for any 
particular problem. The Dirac Hamiltonian becomes, on using the expressions 
for the u's, v’s, <p's, ip's and co's 

N N N 

h d“Z x i f j- f o + X Vj + i T J*J + X w k“k 
j=l j=l j=l k 


N 

\ ’ 


N 

V 


m L VrVZ S % ~ V 

i=i j=i 

'l [Cl x idr)-d L ] p xj + l w kP : 

j=l i=l J J k 


yk 


N N N , 

r- ^f. y o f 0 

‘ L p xj f j “ f o " L \ p Xj sx7 + L p xj s x. 
j=l i,i=l ] j=l J 


■l 


+ /\p. 


k F yk 


(95) 


= H p + terms linear in p^. and p yk 
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where some of the may vanish and others may be indeterminate, indicating the 

presence of first class constraints. The function H p is 

N 


H P=- £ 0 + >_ p xj f j 

j=l 


(96) 


which is consistent with the Pontryagin formulation. 


This Dirac Hamiltonian may be used in place of the Pontryagin Hamiltonian 
in the Maximum Principle, since any contribution of the terms in p^. and p yk in 
the application of this principle will contain p^. or p^ k as vanishing coefficients. 

Vn. HAMILTONIAN TECHNIQUES FOR THE SOLUTION 
OF OPTIMIZATION PROBLEMS 


In the preceding sections a Hamiltonian formulation for optimization prob- 
lems has been developed. It has been applied to a particular optimization problem 
and it has been seen that the Maximum Principle can be incorporated in the formu- 
lation. Further, it has been shown that this formulation can be generalized for 
other optimization problems. In this section a perturbation theory for the solution 
of optimization problems is outlined. First, however, one comment should be 
made on a defect of the method. 


This defect is that the constraints on the initial and final values of the 
state variables have not been explicitly incorporated in the formulation. Just 
how this might be done is far from clear. It may, however, be noted that in- 
corporation of the bounds on fuel flow leads to secondary constraints which imply 
that the fuel flow operates on its bounds for cases 1 and 2 without recourse to the 
Maximum Principle. Explicit inclusion of constraints on the initial and final 
values of the state variables might lead to additional secondary constraints on 
the control variables which would automatically fit the final solution of the 
Hamilton equations to initial and final values. It will be recalled that, in 
addition to cases 1 and 2, which have been discussed in some detail, cases 3, 4, 
and 5 may occur. These cases probably correspond, in some sense, to singular 
solutions of the problem which are significant only for particular sets of initial 
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and final values. Their treatment and interpretation would be greatly clarified 
if the initial and final values were made an integral part of the formulation. 

It should be mentioned that the theory developed in this report assumes 
that a complete set of initial and final values has been imposed on the state 
variables. No difficulty is anticipated in relaxation of this limitation. Incorpora 
tion of transversality conditions into the Dirac formulation appears to be straight 
forward. This would, of course, have to be done for application of the theory to 
orbit transfer problems. 

The theory as developed in the preceding sections is in a form particularly 
suitable for the Hamilton-Jacobi approach. The Hamilton-Jacobi equation de- 
rived from the Hamiltonian in the forms (59) and (64) would be a linear first 
order partial differential equation. Neither of these equations separates. One 
could, however, undertake a perturbation procedure and write 


H D H D0 + H D1 


(97) 


with Hj-jq selected to represent a solvable problem. The selection of would 
depend on the particular problem to be solved. In general, one undertakes to 
split Hj-j so that not only is the H DQ problem solvable, but also that H D1 is, in 
some sense, small compared with H^. It would also be desirable to choose 
Hj-jq in such a way that its Hamilton-Jacobi equation is separable. It is not 
easy to satisfy all of these conditions on H DQ , as will be seen from the examples 
discussed below. Considerable further analysis is necessary before a satis- 
factory perturbation theory for optimization problems can be worked out in de- 
tail. Two ways in which the theory might be applied are: 


Low Thrust Problems 

For such problems it is assumed that the maximum thrust is small com- 
pared with the gravitational forces acting on the vehicle. In addition, some of 
the gravitational forces might be small in comparison with others. Thus, 

H D1 might be chosen to include all terms involving B (since if the thrust is 
small, B is small) as well as those terms involving the small gravitational 
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forces. Then H DQ would represent the optimal trajectory for a vehicle moving 
under a gravitational force derivable from a potential V Q . If the potential Vq 
is just the two body potential then H DQ represents the classical Kepler problem 
in a rather unconventional form. For the problem discussed in Section IV, for 
instance, there would be many more variables than are normally associated 
with the two body problem because of the presence of the p's. Further, the 
Hamilton-Jacobi equation associated with H DQ does not separate for this case. 
Since, however, the solution of the two body problem is well known, it should 
be possible to somehow construct a solution of the Hamilton-Jacobi equation 
which could be used as a basis for a perturbation theory for the low thrust 
problem. * 

High Thrust Problems 

In this case one could select H m to include all terms involving V since 
the gravitational forces would be assumed small compared to the thrust. The 
Hamiltonian H DQ would then represent the optimal trajectory for a vehicle with 
no forces other than thrust. The associated Hamilton-Jacobi equation does not 
separate for this case either. As in the low thrust problems, however, the 
solution for the H DQ can be obtained in closed form and is available for use in 
the same way as the Kepler problem for the low thrust case. 

It thus appears that the development of a Hamiltonian perturbation theory 
for optimization problems is feasible. Further work in this area is planned, and 
results will be submitted as they are obtained. 
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INTRODUCTION AND SUMMARY 


This is the final report on contract NAS 8-11020 entitled "Optimum 
Trajectory Study". 

In this section we will try to give a verbal account of the problems 
considered, the reasons for considering them, and the main results ob- 
tained. The remaining sections, while having independent introductions, 
will contain the mathematical analysis. 

The major objective of this study was to examine the use of Hamilton 
Jacobi partial differential equations in determining fields of optimum 
trajectories and to study sufficiency conditions. Since a great number 
of optimal control problems can, with a slight reformulation, be posed as 
time optimal problems, our attention is focused throughout on problems of 
this type. 

If given initial data, say time t = t , state x = x q for a time 

optimal problem, the reachable set (in Euclidean (n+l) dimensional time- 

state space) is defined to be the set of all points (t, x) with time 

t S t and state x such that it can be attained in time t by a trajectory 
o 

of the dynamical system with an admissible control. Under very mild con- 
ditions on the dynamical system equations and the control set, it is 
known that a time optimal point to point transfer will lead to a tra- 
jectory which lies on the boundary of the reachable set. Conversely, 
trajectories which lie on the boundary of the reachable set are excellent 
candidates for being time optimal for some point to point transfer, and 
thus conditions which single them out are of interest. Now a point is on 
the boundary of the reachable set if in every neighborhood of it there are 
points not in the reachable set? i.e., points not attainable by trajectories 
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of the dynamical system. This leads one naturally to notions of 
controllabi li ty . 

Following the definition of Kalman, a linear system is said to be 

completely controllable at time t if every state can be attained 

(with o£ 2 control) in finite time by a trajectory of the system having 

arbitrary initial data (t , x ). Thus one can examine whether the terminal 

o o 

data has been chosen so that the mission is possible. It is of further 
interest to define local controllability, i.e„, a system is locally con- 
trollable along a solution trajectory f^t) if for some t^ =» t all 
points in some state space neighborhood of t ) are attainable in 

time t^ by trajectories with admissible controls. Obviously trajectories 
along which a system is locally controllable cannot remain on the boundary 
of the reachable set, and hence this becomes a test for optimality. It 
might also be remarked that while for linear systems one could expect 
global controllability results, for nonlinear systems it is natural to 
expect only local results. 

In Section I, the Kalman criterion for complete controllability for a 
linear system is derived in a simple manner (corollary 1.1) and an ex- 
tension is obtained for a special form of nonlinear system (Theorem 1.2). 

o 

In Section II, the nonlinear system x(t) = g(t ? x( t) )+H( t,x( t) )u( t) , 
x an n vector, H an nxr matrix, u and r vector valued control with 
1 ^ r ■« n, is studied. If B(t, x) is an (n-r)xn matrix, of maximal rank, 
such that B(t,x)H(t,x) = 0, the local controllability of the above system 
is shown to be closely related to the integrability of the pfaffian system 
B(t,x)dx - B(t,x) g(t, x)dt = 0. In particular, the above nonlinear 
system is defied to be completely controllable if the associated pfaffian 
system is not integrable. Theorem II .1 then shows that in the special 
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case of a linear system, this definition yields a criterion for complete 
controllability equivalent to that of Kalman. This new criterion is use- 
ful since it does not depend on the knowledge of a fundamental solution 
matrix for a time varying linear system. Its use is demonstrated by ob- 
taining the result that an n dimensional system, formed from a single nH> 
order linear time varying differential equation of the form x (t) + 

a, (t) x (n_1) (t) + . . . + a (t) x(t) = u(t), is completely controllable. 

1 n 

(Here u is a scalar valued control). This result was previously known if 
the functions a^(t) were constant. 

The remainder of section II deals with local controllability in a 
neighborhood of singular arcs. It is shown that local tests, which 
depend on examining the controllability of the variational equation along 
a singular arc will always be non-conclusive. Along an optimal singular 
arc the system is truly not locally controllable, however it is shown by 
example (example II. 2) that singular arcs can exist along which the system 
is locally controllable. These can be thought of as inflection points in 
function space, of the functional (time) which is to be extremized. They are 
analogous to inflection points which arise when extremizing a real valued 
function F on a manifold in Euclidean space; i.e., non-extremal points at 
which the map F induces on the tangent space of the manifold into the tan- 
gent space of the reals, vanishes. 

These arcs are singular also in the sense of the classical calculus of 
variations, hence the Hilbert differentiability condition fails to hold along 
them, and classical sufficiency conditions fail. 

In section III, the study of feedback control via the Pontriagin maximum 
principle and Hamilton Jacobi theory is begun. Often the feedback control 
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which the maximum principle prescribes, is discontinuous in the state 
variables, which in turn leads to a Hamilton Jacobi equation with dis- 
continuous coefficients. This is impractical both from a theoretical and 
computational viewpoint. The first part of section III deals mainly with 
the reason for this discontinuity, and yields conditions such that the 
maximum principle would prescribe a continuous or even C 1 (once continuously 
differentiable) control. Theorems III. 4 and III. 5 then show that whenever a 
control problem merely satisfies the conditions of Fillipov for the 
existence of an optimal control, there exists an approximate problem (the 
precise definition of this precedes theorem III. 4) for which the maximum 
principle gives a C"* - control, and such that for any given € =- 0, an 
optimal trajectory of the original problem will be in an 6 neighborhood 
of that for the approximate problem. 

The remainder of section III deals with the Hamilton Jacobi theory for 
these smooth approximate problems, and for the special case of the control 
appearing linearly, an easy construction for the approximating problem is 
shown, while an example (example III.l) is worked out in detail to 
demonstrate the results. 

Two sets of references are given, the first for sections I and II, the 
second for section III. 
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CONTROLLABILITY AMD THE SINGULAR PROBLEM 


INTRODUCTION TO SECTIONS I AND II 

The concept of complete controllability of linear systems was 
introduced by R. E. Kalman [l]* It is part of the purpose of this 
paper to extend the concept to nonlinear systems, with control appear- 
ing linearly. All systems considered are of this form. 

Geometrically, a linear system is completely controllable at time 
t if any state can be attained in finite time by a trajectory of the 
system having arbitrary initial data x q at time t Q . The motivation for 
the extension of this concept to nonlinear systems came largely from 
results obtained in £2} and from the geometric interpretation of non- 
integrability of pfaffians given in [^3] and [^]» In particular, Cara- 
theodory gives an argument to show that if, for a single pfaffian equation, 
there are points in every neighborhood of a given point which are not 
"reachable" from the given point by curves satisfying the equation, the 
equation is integrable. This result was generalized to systems of 
pfaffians in [4]. There is a difficulty in applying these ideas to 
pfaffian systems which are quite naturally associated with control systems 
having control appearing linearly. (See § II.) The reason for this is that 
usually the independent variable t appears explicitly in the pfaffian 
system, hence its integral curves, which can be related back to solutions 
of the control system, and are used to connect neighboring points to a 
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given point, must have t parametrized as a monotone function of 0" « 

This is not the oase in the proofs in £3] and £4 j , and with this restriction, 
in general the results of these papers are no longer valid. 

The relation between singular problems and controllability arises 
quite naturally from the pfaffian approach and can be anticipated from 
results obtained by LaSalle in £;>]• In § II we define the concept of a 
totally singular arc, i.e,, an arc satisfying the differential constraining 
equations, for which there exists an adjoint vector such that the maximum 
principle yields no information as to the optimality of any of the com- 
ponents of the control along this arc. In particular, if the system 
were linear and admitted no totally singular arc, the system would be 
proper in the sense of LaSalle £5] and completely controllable in the 
sense of Kalman [bj. Even if the controls are merely restricted to be 
oC 2 (Lebesgae square integrable) functions, it is shown that totally singu- 
lar arcs can exist, and comprise some or all of the boundary of the attain- 
able set, thereby being optimal trajectories for certain time optimal 
control problems. These are also precisely the arcs along which the system 
need not be locally controllable, i.e., if we assume initial data x q given 

at time t , there may exist points in every state space neighborhood of a 

G 

point of a cctaiiy singular arc lp V , which are not attainable in 

time t^ > t Q by trajectories of the system with controls. Here Lf? 
denotes the solution of the system with control v. Precisely, if for every 
t^ > t Q there exist points in every state space neighborhood of Ip (t^), 
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which are not attainable with control in time t^, the arc (j? is totally 
singular. However it is shown by example that there do exist totally 
singular arcs about which the system is locally controllable. 


§ I. COMPLETE CONTROLLABILITY FOR LINEAR AND MILDLY NONLINEAR SYSTEMS 

Throughout this section H will denote an nxr matrix valued function 
of t, which is in [t Q , t^ ] for any given finite tj > t Q . - Controls 

will be 3^2* ve ctor valued functions, We begin with the following basic 
Lemma. 

Lemma 1 . 1 A necessary and sufficient condition that there exist an 
rxn matrix valued function V(t) in £t Q , t^ J , such that for some 

f * 1 

t 1 > i 0 ,J n(T)v(T)dT is non* singular, is that for some > t^ 


rh in 

I H( 7 ' )II (T )d T is non-singular. 


ip 

Proof Sufficiency is immediate by choosing V(T) = H (T)» To show 

rt 1 

necessity assume there exist V, t^ > t^such that 1 h(T)v(T )dT is 

r t J t 

f . T ° 

non-singular, but j H(7 )H (T )d T is singular for all t -> t , in 
_ J t 

particular t - t^» ° This implies there exists a constant vector c =/ 0 

1 I(T)H T (T )dT j c T = 0 , and since H( 7 *)H^( 7 ") is positive 
smi-dof ini te , we obtain cli(t) - 0 almost everywhere in [t^, ] . Thus 


such that c 
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f 1 

/ H(T )v(7')dT' = 0 which contradicts the non-singularity of 
■'t 


f 1 H(T)v(T)dr. I 


We next consider the system 


(l-l) x(t) « H(t)u(t) , x(t Q ) = x q , uf ^ [t Q , t 1 ] . 


Define ^ 

M(t o , t x )= / 1 H(r)H T (T)dT . 

t 

o 

Theorem 1.1 A necessary and sufficient condition for the system (l-l) 
to be completely controllable at t Q is that there exists t^ > t Q such 
that M(t Q , t^) is non-singular. 

Proof s (Sufficiency) Let x be any given point in E n , Euclidean n 
space. We will show x is attainable from x^ at time t^. Indeed pick 

u(t) = H T (t) £ , ^ £ E n . We desire x » x(t^) = x(t Q ) + ^ J H (T )H T (7" )dT^£ 

or £ = M” 1 (t oS t ] ) (x - x(t Q ))o 0 

(Necessity). Assume M(t o , t^) is singular for all t^ > t Q . This 

implies (see proof of lemma I.l) that there exists a constant vector 

c ^ 0 such that c H(t) s 0 p.p. Since x q is arbitrary, let it be such 

that c » x » Go We will show the point c is not attainable from x . 

0 t 0 

Indeed suppose for some u and t^, c = x q + J * H(T)u(T)dT . Then 

o 
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f 1 

c.o=||c|| = c o x q + c J H(TMT)dT - 0, a contradiction to 

t 

o 

the fact that c ^ 0. | 

Corollary 1.1 (Kalman) The linear system 


( 1—2 ) x(t) = A(t)x(t) + H(t)u(t) ) x(t Q ) = x q 


is completely controllable at t if and only if 

f*l 


J § (t o ,r )h(t )h t ct ) } T (t 0 ,r)dr is non-singular for some t^l> t Q . 


Here J (t,T* ) denotes a fundamental solution of the homogeneous system 
x(t) = A(t) x(t). 


Proof 8 Hake the transformation y(t) = <| *(t ^ t Q ) x(t). Then x 

satisfies ( 1—2 ) if an only if y satisfies 


(1-3) y(*) - $ (t Q ,t) H(t)u(t) s y(t Q ) = x q . 

(Note $> (t Q » t) = (j> ~^(t, t Q ).) From the transformation, it 

follows that the system (l~2) is completely controllable if and only if 

the system (l~3) is completely controllable, i*e«, from theorem Id that 

there exists a t.> t such that 
1 o 

/ $ (t ,T) H(T ) H T (T ) $ T (t ,T)dT is non-singular. B 

J t c o 

o 
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Some special results for nonlinear systems 


We next consider the nonlinear system 

(1-4) x(t) - g(t, x(t)) + H(t)u(t), x ( t 0 )= X o 

with the assumptions? i) | g^(t 9 x) | < M, j = 1, 2, n. 

ii) | g'ktjx) - g^(t, x)| < m ||x - x|| , j = 1, 2, n, iii) g is 

continuous as a function of t for each x 0 

Again let M(t ,tj = [ 1 K(T)K T (T)*T . 

0 x J \ 

o 

Theorem 1,2 A sufficient condition that the set of points attainable 
by trajectories of the system (1-4) with control be all of E n is 
that M(t Q , t^ be non-singular for some ^ > t Q o 

Remark Rather than state the theorem in this manner, one might con- 
sider merely saying that the system (1—4) is completely controllable at 
t « However, this notion has not been defined for nonlinear systems, and 
it does not seem reasonable to this author to define it in such a global 
fashion for these systems® 

Proof For arbitrary u, (l-4) has a solution designated which 
satisfies 

t ^t 

(1-5) tf u (t) = * o + [ el.T,tp\T))it +J H(T)u(T)dT. 

t t 

o o 

Let x he any given point in E n 0 We desire a control such that for some 
point finite t^> t Q , ^(t^) = x„ It suffices to consider controls which 
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come from a finite dimensional subspace of in particular the controls 

considered will be of the form u(t) = H T (t)f where f 6 E 11 . Hence the 
notation (p ^ rather than (p U will be used. 

Define a mapping 3^8 E 11 — ^-E 11 as follows? 

. rh f 

Let ct (f )=J g(T t (T ))dT , 2 nd define 
*o 

<&(£) = H“ 1 (t Q » t 1 ) [ x - cK,(f ) - x q j . From ( 1—5 ) it follows that 
a fixed point of 3T will yield a value £ such that (p^ (t^ = x. 

It is well known that with the conditions imposed on g ^7» th. 7 .k - 
Chapter i] , (p^is a continuous function of f in the topology c[t Q , t^J , 
i.e., the topology induced by the supremum norm. Thus <X(f ) is a continuous 
function of f , and^is a continuous function of f . 

We next show that there exists a K such that If |S K~l*(f)ll<K. 


Letting ||f | = I |f | , and || M _1 

i=l 1 

for any f. Ik(f)l s »(‘i - v m ' 


be any matrix norm, since 




Letting K = M _1 (t , t 


x + nM(t. - t ) + I x 


, it follows that 


for any f. Il*'(f.)|| < K, hence in particular maps the ball 
{f e II f II < * J continuously into itself. Thus 3*" has a fixed point. | 

Remark The result obtained in this theorem is not surprising in view of 

theorem (i.l) and the boundedness condition on the vector g. Also the 
condition M(t o> t^) non-singular for some t^.> t Q is much stronger than 
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it need be. For example, if we consider a linear system of the form (1-2) 
and H(t) is a column vector with one component zero, then M(t o ,t 1 ) is 
singular for all t^ > t Q , yet the system can certainly be completely 
controllable. 


■§ II. NONLINEAR SYSTEMS WITH LINEAR CONTROL; THE SINGULAR PROBLEM 


In this section, we consider extending the notion of complete con- 
trollability to systems of the form 

(2-1) x(t) = g(t, x(t)) + H(t, x(t))u(t) 

where g is an n-vector, H an nxr matrix, while u is an ,5^ control vector. 
It is assumed that g and H are in all arguments. Throughout, the 
stipulation 1 < r < n is required to hold. 

Let B(t, x) be a c\ (n-r)xn matrix with rank (n-rank H) at each 
point (t, x) in some domain^, of interest, such that 

(2-2) B(t,x) H(t, x) ss 0 , (t, x)(f^. 

Since r < n, we know that rank B >1 for all (t, x). 

With the system (2-1), associate the pfaffian system 


(2-3) B(t, x)dx - B(t» x) g(t, x)dt = 0. 


V 


Let b be an arbitrary linear combination of the rows b of B, 

taken with C; scalar valued coefficients x), i.e., 
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Throughout, b will be used to 


b(t, x).][ C<y (t, x) b^ (t, x). 

V 

denote such a linear combination which is not identically zero. 

Definition II. 1 The pfaffian system ( 2— 3 ) is integrable at the point 
(t, x) if there exists a scalar valued function V|/(t, x) and an 
€ •> 0 such that for some b, 

y^Xt, x) = b(t, x), V|/ t (t, x) = -b(t, x) • g(t, x) 

for t < t < t +€ , | x - x |<C. 

Essentially this states that for some b, 

(2-4) b(t, x)dx - b(t, x) • g(t, x)dt 

is an exact differential in a "neighborhood'* of (t, x). It should be 
noted that any integrating factor can be included in the coefficients 
of the linear combination of the rows b ^ . 

The notion of integrability of a pfaffian system is, of course, 
related to the property of completencess of an associated system of partial 
differential equations. To show the relation, let C(x), i ( E 11 , be a 
smooth (n-r)xn matrix, and K(x) a smooth nxr matrix, both of maximum 
rank, such that C(x)K(x) * 0. With the pfaffian system 

(2-4) C(x)dx = 0 

T f'( x) 

we associate the system of partial differential equations K (x) = 0. 
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Indeed, since all points in some neighborhood of x q are attainable by 


64 



absolutely continuous curves satisfying C(x,(t)) x(t) - 0 almost every- 
where, we must only show that such a curve also satisfies (2-5) for some 
control u. But C(x(t)) x(t) = 0 =^ x(t) is a linear combination of the 
columns of K(x(t) ) , since CK = 0. Thus there exists u(t) such that 
x(t) = K(x(t))u(t) for almost all t. Since K has rank r, it has a con- 
tinuous left inverse on its range, from which it follows that u is 
measurable. 

Before stating an explicit criterion for complete controllability of 
a system of the form (2-l) one may ask* What should one expect the 
definition to yield? This can presently be answered as follows. Since 
the definition should extend that given for a linear system of the form 
(l-2) which is a special case of (2-l), one expects: 

a) If g(t, x) = A(t)x, H(t, x) = H(t), then the criterion which 

defines complete controllability at t for (il.l) should be 

t ° 

equivalent with the condition f 1 $ (t Q , t)H(t)H T (t) § T (t Q , t)dt 

J t 

o 

non-singular for some t^ > t Q , as given in corollary 1.1. 

b) There should be a geometric interpretation of the condition, 
e.g. , what points are attainable from the initial point in finite 
time? In the linear system there were global attainability 
results, i.e., any point could be attained from the initial 
point via a trajectory of the system. In the nonlinear problem, 
one would expect at most local results of this nature. 
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The approach will he to state a criterion for complete control- 
lability of (2-l) which we will show satisfies a). We then use this 
criterion to try to establish a geometric interpretation as mentioned 
in b). Of course, how the definition of complete controllability should 
be extended is somewhat a matter of personal opinion. 

Definition 11.2 The system (2-l) is completely controllable at 

("t, x) €<# if the associated pfaffian system (2-2) is not integrable at 

(t, x). 

It will next be shown that this criterion is equivalent to the con- 
dition given in corollary 1.1 for the special case of the linear system 
(1-2). In this case it suffices to take B = B(t) in forming the pfaffian 
system equivalent to (2-3). Also, in taking the linear combination of the 
rows of B to form the single pfaffian as in (2-4), we can consider the 


scalar functions 



as function of only t. 


that if the pfaffian form 


Indeed we must only show 



b(t)dx - b(t) A(t)x dt 


has an integrating factor, then this integrating factor, denoted by , 

can be taken as a function of only t. To obtain this, suppose ^(t, x) 

is such that ( t , x) b(t)dx - y<(t, x)b(t)A(t)x dt is an exact 

differential. Then U b^ - u b*^= 0 for all i, j = 1, 2, ..., n, and 

r x i x i 

• __ * , 

b = b A x -^b A. Define^t(t) =»^((t, 0), noting that for 
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the linear system $ = C^,** 0 ) x E 11 which implies (t, a)eM for t > t Q . 

It follows that^(t) is also an integrating factor. 

Since it is sufficient to consider both ^ and the as functions of 

only t, there is no loss of generality in considering that if the pfaffian 
system 

(2-7) B ( t ) dx - B(t) A(t)x dt = 0 


associated with ( 1—2 ) is integrable, then (2-6) is an exact differential. 

Since x appears linearly, definition II. 1 simplifies for such systems, 
and iss The pfaffian system (2-7) is integrable at the point t if there 
exists a C 1 scalar valued function ^(t, x) and an €;> 0 such that- for 
some b . 

V x (t> x) = b(t) , H/ t (t, x) = -b(t) A(t)x 

for t <T t <■ t + 6 . (Notes Under the assumptions on B and H, ^ 

3X1(1 ^x exist and are equal). 

Define « 

W(t c , S) = J l(t o , t)H(t)H T (t) $ T (t o , t)dt . 

"t 

o 

Then corollary 1.1 states that the system (l-2) is completely controllable 
at t Q if and only if there exists a t ^ >■ t Q such that W(t Q , t^) is non- 
singular. 

Remark 1 . If A and H are constant matrices, Kalman £lj shows that this 
condition is equivalent to the conditions rank [ A, AH,... A n_1 H] = n. 
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Remark 2. While the above condition given for the constant coefficient 


case can be directly checked, W(t Q , t^) depends on knowledge of a 
fundamental solution $(t, t ) which is not always easily obtainable. 

Remark 5 . It is easily seen that w(t Q , t^) is a positive semi-definite 
matrix. Thus if W(t o ,t 1 ) is non-singular, W(t Q , t) is non-singular for 
all t "> t^. 

The main purpose of this section will be to show that the condition 
II. 2 for complete controllability of (l-2) is equivalent to W(t Q , t^) being 
non-singular for some t^ > t Q . This condition has the advantage of not 
depending on knowledge of a fundamental solution. 

Before stating the main theorem, a simple computation yields, 
for t Q < < t 2 , 

v(t 0 , t^) = w(t Q , t^) + $(t^, W(t 1# t^) (t o , t^). 

Thus if W(t^, tg) is non-singular (positive definitive) it follows that 
W(t Q , t^) is also non-singular (positive definite). The reverse im- 
plication need not be true. 

Theorem II. 1 A necessary and sufficient condition that W(t n , t^) be non- 
singular for all > t-, is that the pfaffian (2-7) be not integrable at 

V 

For ease in both using and proving this theorem, we list the implications 
and their contrapositives. 
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I. A Necessary condition ? Vf(t^, tg) non-singular for all tg > t^ 

t > pfaffian (2-7) is not integrable at t^. 

I.Bvf Necessary: contrapositive : Pfaffian (2-7) integrable at t^ ==$> 

V(t lf tg) is singular for some tg .> t^. 

I.C Sufficient condition s Pfaffian (2-7) not integrable at t^ 

=^>W(t^, tg) is non-singular for all tg^ t^. 

I.D Sufficient: oontrapositive : W(t^, tg) singular for some tg > t^ 

: pfaffian (2-7) is integrable at t^. 

Proof : (We shall prove I.B and I.D) 

Assume the pfaffian (2-7) is integrable at t^. Then there is 

a vector b, which is a linear combination of the rows of B, and an 

€ > 0 such that b(t) = -b(t)A(t), for t^< t< t^ + € . Let 

$(t, t^); $(t^, t^) = I, be the fundamental solution of x = A(t) x. 

Then the vector b admits the representation b(t) = c F 1 (*» V = 

c §>(t^, t) for some constant vector c. Let h(t) be any column of H(t). 

Then 0 = b(t)h(t) = c $(t^, t) h(t). Since h was an arbitrary column 

T 

of H, and W is positive semi-definite , we have c W(t^, t) c =0 for 
t^<. t < t^ + £" showing that there exists a tg > t^ such that W(t^, tg) 
is singular. 

Assume, next, that w(t^, tg) is singular for some tg> t^. Prom 
remark 3, it follows that W(t^, t) is singular for all t^<! t<Ttg« 

This implies there exists a vector c(tg) such that c(tg)w(t^, tg)c (tg) = 0. 
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Since the integrand of the integral defining W(t^, tg) is continuous, 

c(t 2 ) |(t 1 , t)H(t)H T (t) t)c T (t 2 ) =0 for t 1 < t<t 2 . 

It follows that 0= c(t 2 ) §(t^, t)H(t) = c(t 2 ) §”^(t, t^)H(t), thus b 
defined by b(t) = c(t 2 ) $ \t, t^) is an admissible vector in the sense 
that b(t) H(t) = 0, i.e., b lies in the subspace spanned by the rows of B. 

Define the scalar valued function V[/ (t,x) = c(t 2 ) $”*(t, t^)x. 

Then x ) = b ( t )j x ) = A (t)x for tj.< t < t 2 showing 

that the pfaffain (2-7) is integrable at t^. | 

The following illustrates the advantage of a definition of complete 
controllability for linear systems which does not depend on knowledge of 
a fundamental solution. 

It is known that an n dimensional system which is formed from a 
single n^ order equation having constant coefficients and the control 
as forcing term is completely controllable. We next show that this is 
also true for time varying systems of the form 

x^ n \t) + a^(t) x< n ^(t) 4 ... 4 a n (t) x(t) = u(t). 

Specifically we shall show that for any t Q , the associated pfaffian is 
not integrable implying W(t o , t^) is non-singular for all t^ > t Q . 

We take the equivalent linear system of the form 
y(t) = A(t) y(t) 4 h(t) u(t) where 


0 X OoooooO 


0 

0 0 1 


o 

0 • 

o ® « O 

M3 

li 

« 

o • . 0 


c 

• . 0 


• 

0 0 *1 


0 

“ a n » ” a ri-l > ® ° ° 


1 
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One can choose B(t) as the (n-l)xn matrix 


B(t) 


1 0 0 ... 0 0 

0 1 0 ... 0 0 


^0 0 0 ... 1 oj 

The pfaffian system equivalent to (2-7) is then 

(2-8) dx 1 - x £ cLt = 0 

dx^ - x^ dt « 0 


dx i dt = 0. 
n-1 n 


If (2-8) were to he integrable there must exist scalar valued functions 

£?(.(t), not all zero, so that the single pfaffian 
0 


n-1 


I OfjW 


dx. + 0 dx 


n 


j-1 


n-1 

Y. x j+i dt 


is an exact differential. But this would imply £>C.(t) =0, j = 1, 2, ..., 
(n-l), which shows (2-8) is not integrable for any t Q . 
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Geometric Interpretation, Local Controllability, and the Singular Problem 


By associated a pfaffian system of the form ( 2 - 3 ) with the system (2-l), 
it is conspicuous that the stress is taken away from the functional form of 
the elements of the matrix H, and placed only on what the range of H(t, x) , 
considered as an operator on E , is. This obviously should be the case 
when controls are required to be only functions. 

In [ 9 ]* Markus and Lee consider a system of the form x * f(x, u), 
f e in E n 1 , where £7 a compact set contained in E r with 0 in its 

interior, is the range set of the control. Assuming f(0, 0) = 0 and 
letting A * f x (0, 0), H =. f^(0, 0), it is shown that if the linear system 

o 

x as Ax + Hu is completely controllable, then the set of points from which 
the origin can be reached in finite time by trajectories of x = f(x, u) , 
is an open connected set containing the origin. Kalman [loj pointed out 
that a similar result can be obtained for a system of the form x = f(t, x, u) 
by assuming the linear approximation, is completely controllable in terms of 
the criterion given in cc roil ary 1*1, 

The system 

( 2 - 9 ) x(t) « f (t, x(t ) 9 u(t)) , x(t Q ) = x o 

2 

where x is an n vector, f is a C vector valued function and u is a r vector 
valued measurable control, is said to be locally controllable along a 
solution <f corresponding to control v if for some t^>* t all points in 
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some state space (n dimensioned) neighborhood of L^> (t^) are attednable in 

time t^ by trajectories of (2-9) with admissible control. 

It would be somewhat falacious to say that a time dependent system 
is locally controllable, say at the origin, if all points in a neighborhood 
of the origin in state space are attainable by trajectories of the system 
in finite time. To see this, we consider the following example of G. Haynes. 


Example 1 « 

x 1 = - Xg + (cos t) u , x(o) « 0, |u(t)| < 1 

X 2 m X 1 + (®i n t) u . 

An integral of the motion is seen to be x^ sin t - x^ cos t = 0, which one 
can picture as a rotating (with time) line in x^, x 2 space. As t varies 
from 0 to 2 7T, all points of E 2 are swept out by this line. Now multiply 
the first equation by cos t , the second by sin t and one obtains by adding} 


dt 


(x^ cos t + x^ sin t) = u or 


x^ cos t + x^ s 


integral of the motion gives 

-t 


r* 

xn t ~ I u 

J 0 


(T) dT « Combining this with the 


x x 2 (t) + x 2 2 (t) = 


J »(r>* r 




implying that as time increases, the 


two dimensional neighborhoods of the origin of E which are attainable 
also increase. 
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Since all solutions lie on a surface in (t, x) space, one would 
hardly feel that the system should be termed locally controllable and is 


not locally controllable by the definition given above. 

We next proceed with an analysis, similar to that used in the papers 
[ 9 ] and [lo], to examine local controllability about a given trajectory of 
the system (2-l). Let x(t o ) =* 0 be initial data for this system v an 
arbitrary control and Lf? the corresponding solution. Let u(t;£), 
f 6A be a family of control, ouch that u(t, 0) . v(t), exists, and 
denote x( . j£ ) as the response to u( „ ,£). Then x( . ; £) satisfies 
■ t 


t(t; £) » / g(r, x(T$£)) + H(T,x(T; £)) u(T; £ ) 
J t L 


d 7* . 


£ (t? o)==j Jg x (7» (p(T)) + H x (T, (p v (T)MT ) x^(T, 0 ) 

+ h(T, (p v (T )) (f , 0)dT 


where H v is an nxn matrix with ij® element H 

x ^ / . x 


V 


V - 1 


For each t > t^ 9 we view x(t;£) as a mapping £ x with 

0 — — ■*(p V (t)o Let z(t| u£ ) denote the Jacobian matrix X£ (t$ 0). 


We have? If for some t, u 



u t ) is non-singular, the attainable 


set at t contains a neighborhood of the point ^ )V ( t) . Let $(t, t Q ) be a 
fundamental soluti on matrix of the system 

x(t) =- [g x (* »(^ V (l)) 1 - H x (t, ^ V (t))v(t) j x(t). Then 
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z(t; If , u £)— / K t »T) H(T t ^(T))u£ (T ; o) df . 

Prom lemma 1.1 and oorollary 1.1 we have 

Theorem II. 2 (Kalman) A necessary and sufficient condition that there 

exist an rxn matrix u£ such that Z(t^j (p V , u^ ) is non-singular for 

some t_ > t is that the linear system 
1 o 

y(t) = (t)) + H x (t,(p v (t))v(t)] y(t) + H(t, ip v (t))u(t) 

is completely controllable. 

In terms of the pfaffian approach the equivalent theorem is 
Theorem II. 3 A necessary and sufficient condition that there exist an 

rxn matrix U£ such that Z(t^, (p V , u£ ) is non-singular for some t^ > t Q , 
is that the pfaffian system B(ts, (p V (t))dx - B(t,^? V (t)) 

[g x (t, ^ T (t)) + H^(t 9 (p T (t))v(t) ] x dt = 0 be non-integrable, for some 

t^^ t o , i.e., that 

(2-10) b(t, lp v (t))dx - b(t,(^ V (t)) [g^t, y? V (t)) + H x (t, <p(t)) v( t ) ) J x dt 

is not and exact differential for any b which is a linear combination of 
the rows of B. 

The same method, when applied to a system of the form (2-9) yields 
Theorem II, 5 * A sufficient condition that there exists a t Q such that 

all points in some state space neighborhood of (p V (t^) for all > t^ are 


75 



attainable in time t^ by trajectories of (2-9) with admissible controls, 
is that there exists a > t such that the pfaffian system 

B(tj v)dy -= B(t$ t) f x (t, ty? Y (t), v(t))y dt = 0 

is not integrable at t^. [line notation B(t$ v) is used to denote the 

dependence of B on the reference trajectory, specifically 
B(tj v) f u (t,(^ V (t) s v(t)) = 0 ,] 

It is interesting at this point to see the implications of the 
assumption that (2-10) jLs an exact differential* This implies and is 
implied by 

(2“H) ~ b(t, (p V (t))= -b(t,^? Y (t)) [ g x (t, ^7 V (t) + H x (t,(p V (t))v(t)] , 

which can be recognized as the so-called adjoint system of the maximum 

principle [ll] approach to the time optimal problem for system (2-l). 

It should be noted that if b(t,^> Y (t)) satisfies (2-ll), then it is an 
adjoint vector which is orthogonal to all of the columns of H, Since the 

maximum principle (for control components bounded by one in absolute value) 

n 

impliess choose u^(i) « sgn £ b^(t, ^ Y (t) )H x ^(t, to v (t) )s in this case 

it yields no information* 

I shall designate such a problem as one which admits a totally 
singular arc ip * , i,e», where the maximum principle yields no information 
in the time optimal problem , for any components of the optimal control. 

The arc would be singular, but not totally singular, if there is an adjoint 
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vector orthogonal to some, but not all columns of H. 

Theorem I I . k The pfaffian form (2-10) is an exact differential if and 

only if Ip is a totally singular arc. 

Proof s It has been shown above that if (2-10) is an exact differential, 
then the vector b satisfies (2-11), which implies is a totally singular 
arc. 


If ip Y is a totally singular arc, there exists a vector p(t) such 
that i) p(t) H(t,p V (t)) =; 0 and ii) p(t) = -p(t) 

k (t * ( t ) ) + H x (t,(p V (t)) v(t)j . Prom i) we conclude that p(t) is 

a linear combination of the rows of B(t, l^? T (t) ) , while Ii) implies that 
this linear combination, (2-10), is an exact differential. J 

V 

To summarize j (p not a totally singular arc implies the pfaffian 
form (2-10) is not an exact differential which implies there exist 

t Q and u£ such that Z(t,p T , u ^ ) is non-singular and the attainable 
set at time t contains a neighborhood of the point p V (t). The contra- 
positive of this statement provides an interesting characterization of 
totally singular axes, i.e., if for every t^ > t Q there exist points in 
every state space neighborhood of (p V (t^) which are not attainable in time 
t^ with controls, the arc. is totally singular. On the other hand, 
as will be shown by example, a totally singular arc can remain on the 
boundary of the attainable set, and thus provide a time optimal trajectory. 
Theorem 11.5 If the system (2-1 ) is not completely controllable at t Q , 
Z(t,p V , u h) is singular for all t t Q , ua and all reference trajec- 

V V ^ 

tories (p , i.e., every trajectory p is totally singular. 
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Proof s Any vector b, which is a linear combination of the rows of 
B, satisfies b(t, x)H(t, x) = 0. Thus for any vector v(t), 

[b(t, x)H(t, x)v(t)] = 0, or v(t) H T (t, x)b x (t, x) = 

-b(t, x)H^(t, x)v(t). Evaluating this identity at the point (t, (p V ( t)) f 
substituting into (2-ll) and expanding of the left side yields 

(2-12) b t (t, <p\t)) + b(t, (p V (t))g x (t, Cp V (t))+g(t, <p V (t))b x T (t, (/(t)) = 

Y(t)H T (t,<p Y (t)) [b x (t,(f v (t) - b x T (t,(^ V (t))] . 

This identity provides a necessary and sufficient condition that (2-10) 
be an exact differential, i.e., that (p r be totally singular. 

Now assume the system (2-1 ) is not completely controllable. This 
means that for some b, a linear combination of the rows of B, the pffafian 
form b(t, x)d_x -b(t, x)g(t, x)dt is sui exact differential, or 

* t (t, x) = -b(t, x) g x .(t, x) -g(t, x) b x T (t, x) 

b x (t s y)~- b x T (t, i)s0, 

Evaluating these two identities at (t,(p V (t)) for an arbitrary control v 
shows that (2-12) is satisfied, hence every trajectory (p* is totally 
singular. 

A conjecture which one might be tempted to make is that if the 
system (2-l) is completely controllable, it admits no totally singular 
arcs. This is not true, as the following example from [2] shows. 
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Example II«1 


“ X- 


u 


x 1 (0) = 1 


x = - x g + u 


x 2 (0) = 0 . 


For the time optimal problem of reaching the point (2, 0), it is shown 
in ^2 j that u = 0 is the optimal control, if the restriction |u(t)| < 1 
is imposed, and it easily follows that this is also optimal in the class 
of controls. 

For this problem, one can use for the matrix B, the single vector 
2 

b a (l, x^ x 2 ). The associated pffafian equation is 
dx^ + x.,^ x 0 dx., (x 2 - l)dt » 0. 

1 1 £. d A £ 

2 2 2 \ 

Let x = (x 2 , x 2 ) t a(x) = (l, x^ x x (x ? - l)),. Then (curl a(x)). 

fj 

a(x) = 2 X thus the pfaffian is not integrable. 

The optimal path from the point (l, 0) to (<sK, 0), <?(>!, is ob- 


tained with control u a C, and is 


J \ - / 


This is a totally s i ngu 1 ar arc. To show this. 


0 . 


we note b(t,(jp°(t)) H. (l, 0). 

b(t,(p°(t))dx -b(t,tp°(t)) [g x (t, ^>°(t)) + H x (t,/ f °(t)) ° 0 ] x dt 


2x 1 

= (3x, +0 dx - dt. 
i c. j-t 


79 



Let a(xj t) — (lj 0, 


i-t 
2 x. 


). Then (curl a), as 0 which implies the 


pfaffian dx^ + 0 dx,, - ■ y -- ” dt = 0 is integrable, and (p° is a totally 
singular arc. Here the arc is on the boundary of the attainable set. 

It should be stressed at this point that it has not been shown that 
if for some control v v the matrix Z(t, tp 9 ) is singular for all t > t ^ 

and u £ then sufficiently small n neighborhood.3 of a point (t) con- 
tain points not attainable in time t, from initial data 0 given at t Q . 

In fact it. will next be shown (Example II.?) that, this is not the case. 


TV 


do this we must produce a time optimal problem which posseres a totally 
singular arc which yields neither a maximum or minimum. Since the arc is 
totally singular, Theorem II. 4 shows that one cannot conclude that the 
system is locally controllable along this arc by considering the linearized 
equations as in Theorem II. 2. However the use of theorem II. 3* on certain 
arcs which differ from the singular arc but have some points in common with 
it, will establish the local controllability. 

We consider control, systems of the form studied in [ 2 j . 


(2-13) 


y(t) - A.j (x(t) ) + B ? (x(t)) u( ',) 


* ? (t) - A 7 (x(t)) + B ,(x(t)) u ( t) 


t( O') =■ 


u(t) <1 


We assume that In some region of interest $ of state space, 


(?~14) A(x.)s* ' (x) + ,fx) A„(x) d 0 

? ' L * £ 0 


arid that A., B „ , i 


are C in Ac. 
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The pfaffian system associated with (2-13) is the single pfaffian 


equation 


(2-15) B^(x) dx-j - B^x) + A(x) = °« 


Since A(x) y- 0 and multiplication by a factor does not change inte- 
grability, this can be rewritten as 


(2-16) 


B 2 (x) 

d c 


B„(x) 

A 


4x, + dt = 0. 


/b 2 (x) BjC*) 

Let Z(x) = | 9 - -gjr^ » 1 J 5 then a necessary and sufficient 

condition that the pfaffian (2-l6) be integrable at a point (t, x) i: 
that Z(x) • curl Z(x) = 0 in a neighborhood of x. Computing yields 


Z(x) ® cirri Z(x) 




B-,(x)' 


d V a T ^ 7 " d> 


B 2 (x)' 

aT^T 


= - A\ x ), 


rom 


1 \ / C7 “2 

where Cc>(x) (using the of M> can be directly computed f ! 

X 

the right sides of tee differ ntial equations 

Let v fee a t Lnuoua control (this is sufficient continuity when 
the control appears linearly) satisfying Jv(t)| <C1 9 and let (p V be th€ 
corresponding trajectory of ( 2 - 15 )* 

Theor e m 1I 0 6 If for some tj > is not a zero of ud 9 then 

for any t^-> t n all points in. some state space neighborhood of (p (t n ) 
are attainable by trajectories of (2-1 j), in time t, „ with admissible 
controls,. 
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Proof 8 The variational equation for the system (2-13) about the 

y 

trajectory (0 is given by 

Kt) - [a^M) * v(t) B x (^ T (t))] y(t) ♦ B«p T (t)) u(t) 

(i)- 


where A 


2 ' 2 
for this variational equation is 


B * f ^ o The pfaffian equivalent to (2-10) 

V B, 


(2-17) B 2 (tp V (t))d yi - B 1 ((^ V (t))cLy 2 + (-B 2 (<p V (t)) t)) ) [k x (Cf V {t)) + 

▼(*) B z Up T (t))] y dt - 0. 

A sufficient condition that ( 2— 1 7 ) be not integrable at t^ is that 

(2-18) £ (B 2 ((p T (t)),B 1 (( < > v (t))) 4 

t.t 1 

v(t. ) B^(^* r (t^ )) ] , which is implied 

byCL>(^ V (t)) 4 0 as can be shown by a s tr ai gh t f o rwar d calculation® 

[in terms of Theorem II.'*, (2-18) states that (p^t^) is hot a point of a 
singular arc. In [?, pg. 97 ] it is shown that for systems of this type 
singular arcs are characterized by the fact that (jD is zero along them® 

It follows that if (p> v (t^) is not a zero of LO, then it is net a point of 
a singular arc, hence ( 2 .- 5 ?) is not integrable and the conclusion of the 
theorem follows.] | 
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It should, be stressed that the integrability of (2-16) requires 
CO (x) = Z(x) • curl Z(x) to be zero in a neighborhood of a point, while 
Theorem II. 6 deals only with the value of U) at a point. It is possible. 
Example II. 1, to have the pfaffian (2-16) not integrable at a point (t, x) 
at which fcc>(x) = 0, and yet have a trajectory (_p V such that (/&) = x 
and the system is not locally controllable about ^> V . 

We next give the example of a problem which is locally controllable 
along a totally singular arc. 


Example II. 2 (A singular arc (p°(t) such that all points in a neighborhood 
of (p°(t^) are attainable in time t^. ) 

Consider the system 

x^ = u | u(-t) | < 1 

x 2 « 1 + x 2 x 1 2 u x(0) =» 0 


p 

Then A(x) = 1, (ij(x) = x. ( 9 hence if we were to consider the time 
optimal problem of reaching the final point x^.(0, }&), the Greens theorem 


approach 



yields the fell owing 



Figure 1 


x 


1 
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the optimal axe being shown by the arrows. There is an arc along whioh 


UJ ^ 0, i.e., x^ 5 0, and while this can be attained with the control 
u = 0 it yields neither a maximum or minimum to the time optimal problem. 
This arc we designate as 


fw . J K « s 0 

(^^2° “ tc 


It is easily checked that the variational equation along (j) is not 
completely controllable. 

Now consider a relation x^ = sin k^ x^» k^, k^ > 0 with > 4 7T • 
It will be shown that for k^ sufficiently small, there exists a unique 
admissible continuous control u(t) with trajectory Cf? U which has 
■£(x^, x^)* x^ = k^ sin k^ x^, x^ > 0 j as its track. 

From the- Greens theorem approach ^2 J and the symmetry of £tl(x) about 
the line x^= 0, the parametrization of (p U must be such that the even 
numbered crossings of the x^ axis, counting only crossings which occur for 
x^ > 0, one must have 

^ ( SalL ) , ° ( SalL ) 


<p u ( isJL ) _ gn.II.. _ f D 0 f 

~2 ' k, > ~ k “ j 2 ^ 


o / 2n TV 


We will be interested in the case n ~ 1, so that 


<l/2. It will 


be shown that there is local controllability along 


and since 
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(p u ( ~~ ) = (p° ( ) , it will follow that a neighborhood of 


tp ^ ) is attainable in time 


2 7T 


First we will show that for sufficiently small, there is a 
unique continuous u which leads to a trajectory lp a having 

*1 " *1 Sl " *2 V X 2 Sl °} as its track. Differential 
of the track relation with respect to t yields 

*l(t) = k x k 2 [cos k 2 x 2 (t)] x 2 (t). 


Substitution from the system equations leaves 


( 2 - 19 ) u(t) - k x k 2 [cos k 2 * 2 (O] [w t)x/(t)u(0] . 
For any control u. 


r 

x,(t) u(T)d X 

J o 



Substituting these in (2-19) yields an expression of the form 
u(t) * kj^u ){%) 

where the definition of the nonlinear operator ^Tis obvious. Let 


Lon 
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C [o, #] denote the space of continuous vector valued functionsu on the 
interval ^0, % j , with the supremum norm, and B^ the closed ball of radius 
in this space. It is easily shown that for ^ sufficiently small but 
positive, u e #=t>k 1 aru e A and k 1 3Tis a contracting map. Thus 
has a unique fixed point in B^, call this point u. Then lp U is not 
a singular trajectory, since k_, positive implies u(t) ^ 0, and (p U has the 
desired track. 

JX 


Now for 0 <2 t^<C , (p U ( t-j ) is not a point of the singular arc, 
hence not a zero of u) • From Theorem II. 6 it follows that all points in 


some neighborhood of £p u (t 2 ), for any t 2 > ^ are attainable in time t 2 by 

27 T 

trajectories with admissible controls, hence this is true for t * — r — . 

u 2 

To determine local controllability along (p by use of the fundamental 
solution of the variational equation about this trajectory would be a 
virtually impossible task. 

In concluding, it should be noted that totally singular arcs were de- 
fined with no mention made of transversal ity conditions. It is possible to 
use these conditions, in very special cases, to rule out the existence of 
singular arcs in the optimal strategy. Also, for a time optimal problem 
for a system of the form 


(2-20) x(t) •= g(x(t)) + H(x(t))u(t) 

the maximum principle yields the fact that the Hamiltonian is constant 
along the optimal path. We shall show that this cannot he used to rule 
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out totally singular arcs, since such arcs automatically satisfy the con- 
dition even though the Hamiltonian is seemingly a function of time along 
them. 

For the system (2-20) with any given control u(t) we define the 
Hamiltonian for the time optimal problem as 

74 (t, x, p)= p . g(x) p . H(x) u(t) + 1. 

A necessary condition is that )/ia a constant along the optimal tra- 
jectory, it need not be so on a non-optimal trajectory. Define the ad- 
joint system as 

(2-21) p(t) « -p(t) g x (x,(t)) -p(t) H x (x(t))u(t) 

Theorem II. 7 The Hamiltonian for the system (2-20) is constant along 
any totally singular arc. 

Proof s We defined a totally singular arc as an arc (p U which 
satisfies (2-20) for which there exists and adjoint vector p(t) satisfying 
(2-21 ) such that p( t )H(^> d ( t ) )= 0 for a set of t values having positive 
measure. Then 

(2-22) ™ >|(t 9 ^? U (t),p(t)) = [p(t) . g((f? U (t)) +■ l] = Pjg 3- + p i s *y *Py * 

From (2-20) g^= <jp. U - H^ k . 

From (2-21 ) p A = - p - p. H ik u . Substituting in (2-22) 

Xy U 1. Xy K 
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■& X- <t.£> U (t),p(t)) = pj^i" - H lk u k ] ■* [-p y - Pi Ilf ujlpj 
* [- »i bUc - p± "k= -•£« [p( t ) H «p U(t), ]J u 


from the condition p(t)H( ^) U (t)) x 0, | 
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III. THE EQUIVALENCE AND APPROXIMATION OF CONTROL PROBLEMS 


INTRODUCTION TO SECTION III 

In this section we will be concerned with the time optimal feed- 
back control problem for an n vector system of the form 

(3-1) x(t) = f(t, x(t), u(t)) 

where the control u is an r vector valued function with values in a 
given set U. The major interest will be in feedback controls. 

One of the difficulties in the theory of optimal feedback control is 
the discontinuity of the control with respect to the state variables, 
which the necessary condition termed the maximum principle, so often 
shows to be the case. Letting H(t, x, p, u) £ p° f(t, x, u) - 1; 
u*(t, x, p) be so that H(t, x, p, u*(t, x, p)) % H(t, x, p, u) for 
all u €. U, and H*(t, x, p) = H(t, x, p, u*(t, x, p)), the Hamilton- 
Jacobi equation approach [l] often leads to a partial differential 
equation with discontinuous coefficients, while the Hamiltonian equations 
of motion which describe the system (the characteristic equations of the 
Hamilton-Jacobi equation) are of the form 

(3-2) x = H*(t, x, p) , p = - ^H*(t, x » p )° 

The maximum principle of Pontriagin, for time optimal problems, assures 
ws that if u*(t) is an optimal control, x*(t) the corresponding optimal 
trajectory, then there exists an absolutely continuous n vector p*(t), 
not identically zero, such that H*’(t,x*(t) ,p*(t)) = H(t,x*(t) ,p*(t) ,u*(t)) 
while x* and p* satisfy equations (3-2). The usual use of the maximum 


(i- w 
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principle proceeds, however, by attempting to generate candidates for an 
optimal trajectory by solving a two point boundary value problem for the 
system (3-2)* Since u* may be discontinuous, the fundamental questions 
of existence and uniqueness of solutions to these equations cannot easily 
be answered. 

An alternative would be to restrict the controls to be continuous, 
or even c\ (continuously differentiable) functions and attempt to gen- 
erate within this class a sequence of controls which will in some sense 
tend toward the optimal control, In doing this, however, one must 
seemingly discard the maximum principle which is one of the most useful 
tools for generating optimal controls, for it so often demands dis- 
continuous controls. 

The approach taken here is not to forcefully restrict the class of 
approximating controls, but instead to generate a class of approximating 
problems whose solutions will be continuous or controls and will tend, 
in a given sense, to the solution of the original problem. 

For the system (3-1) let R(t, x) = } f(t, x, u) : ueu) . We shall 

o 

say that the time optimal problem for a system x = g(t, x, v), v G V is 

equivalent to that for the system (3-1) if {g(t,x,v):v £ vj = R(t, x) for 

all (t,x) in some domain of interest. For given £ > 0 we define the time 

• £ 

optimal problem for the system x = YT (t, x, v) , v £. V( € ) to be an 
£ -approximate equivalent problem to the time optimal problem for (3—1) 
if d([h 6 <t,x,„) : v £ y( € )| , R(t, x^)-c £ for all (t,x) in the domain 

of interest. Here d(Q,R) is the Hausdorff metric distance for sets in 

E*. 

Intuitively equivalent problems have the same optimal trajectories (as 
will be shown) while the optimal trajectories of £ - approximate equivalent 
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problems will be close (uniformly) to those of the original problem. 

It will be shown that under appropriate conditions (essentially the 
Fillipov existence conditions ^2j ) the approximating problems can be 

chosen in such a way that the corresponding feedback controls are con- 
tinuous, or even of class c\ In certain cases this allows the Hamilton- 
Jacobi theory, as derived in |lj , to be utilized for the construction 
of fields of optimal trajectories and optimal feedback controls. 

Although we deal only with the time optimal problem, it should be 
noted that for a problem of the form x'(T ) = f ( T~, x( V ) , u(T")), with 


the functional to be minimized being 



x(0“ ) , u(«T ) )d<T 


where 


the scalar valued function o C satisfies o(_(d" , x, u) i S =* 0, the change 
of independent variable 


t(T) 



, x(c r), u(<r))dcr 


reduces the problem to an 


equivalent time optimal problem for the system 


y(t) = 


| oC(r(t),y(t),u(t)) 


f( T(t) ,y(t) ,u(t)) = g( t,y( t) ,u(t) ) . 


THE MAXIMIZATION OF p.r WITH r IN A STRICTLY CONVEX SET 

Our motivation is to choose approximating problems for which the 
maximum principle will yield smooth controlso Let r*(p) be the function 
which maximizes the functional F(p,r) a p«r for fixed p €. E n - [oj , 
r € R a given compact set in E n 0 We begin by examining conditions on 
the set R which will insure that r* is smooth since it is a maximization 
of this type which causes discontinuities in the controlo 
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Definition . If S is a set contained in E 11 (Euclidean n space) a support 
hyperplane is a hyperplane H which lies on one side of S and SH H / ^ , 
the empty set. 

Definition. A convex set R contained in E 11 will be said to be strictly 
convex if it contains more than one point, and every support hyperplane 
has at most one point in common with R. 

If R is a compact set in E 11 we denote its boundary by c)R. 

Lemma III.l . If R is a strictly convex set in E^, then R has internal 
(interior) points. (This result depends on finite dimensionality). 

Proof Let r , r, € R, r / r n , and V., be the linear variety of 
dimension one determined by these points. Let be any hyperplane con- 
taining V^. Since contains two points of R it is not a support plane 
and there exists a point r e R, £. Let V 2 be the linear variety 

determined by r , r^ and has dimension two. Let be a hyper- 
plane containing Again there is a point r^ €. R» r^ ft M^. We con- 

tinue inductively getting at the (n-l)st step a linear variety V n _^ of 
dimension (n-1) determined by the points r , . . . , r n _^“ Then there 
exists a unique hyperplane ^ containing V n _^, and again a point 

r € R, r $£ M , . Since R is convex it contains the convex hull 
n n n-1 

of the set of points r , . . . , r ; and since the vectors - r Q , 
r - r Q , . . . » r n “ r Q are linearly independent, they determine an 
n cell which has non void interior. | 
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Lemma I II. 2 . Let R be a strictly convex, compact set in E” ♦ Then for any 
fixed pef -[0], the function F(p, • ) attains its maximum value at 
a unique point r*(p) = r Q £ 

Proof For any fixed p, F(p, • ) is a continuous function oil the compact 
set R and hence attains its maximum there. Suppose the maximum is attained 
at an interior point r Q €. R. Let N(r Q ) be a neighborhood of r Q contained 
in R. Then p * r is an interior point of the real interval p • N(r ) = 

« r: r eN(r Q )} , contradicting the fact that F attains its maximum at 

r • 
o 

To show uniqueness, assume F(p, • ) attains its maximum at r Q , while 
/ r Q belongs to R and F(p, r^) = F(p, r 0 ^° Define 
r(oC) = 0 (r + (1 - oO r. , - 00 < 06 < 00 o It follows that 
F(p, r(oO) = F(p, r ) for every such point r(oO® If for some oC * 
r(oO is an interior point of R, the argument of the previous paragraph 
would show a contradiction to F(p, * ) attaining its maximum at r^o Thus 
the one dimensional linear variety V = ^o£r Q + (l-od)r^s - oo-^o^^ ooj 
does not intersect the interior of R, which is not empty by Lemma III 0 1 0 
By theorem 3.6-E pj there exists a closed hyperplane M containing V such 
that the interior of R lies strictly on one side of M. It follows that M 
is a support plane for R 9 and since M contains more than one point of R, 
this is a contradiction to the strict convexity. | 

Theorem III.l Let R be a strictly convex, compa ct set in E n . Then the 
function r*(p) (shown to be well defined in lemma III-2) is continuous. 

Proof Suppose p R *■ p. Since R is compact, some subsequence 

of the sequence [ r *(P n ^ converges to a point of R, and there is no 
loss of generality in assuming it is the original sequence, i.e. 
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let r*(p n ) ► r^. We suppose r*(p) = r 2 / and seek a contradiction. 

From the definition of r*, F(p, r 2 ) =»F(p, r^); let 

F(p, r 2 ) - F(p, = S => 0. 

Since F is continuous there exists an N =» 0 such that 

|F(p n , r 2 ) - F(p, r 2 i^ *7^ and |F(p, - F(p n , r*(p n >|*= ^ 

for n>N. Then F(p n> r 2 > - F(p n , r*(p Q )) = [f(p, r 2 ) - F(p, r.^] 

+ [F(p n , r 2 ) - F(p, r 2 )j + [f(p, r.^ - F(p Q , r* <P n ) )]^ ^/ 2 for 
n >N, a contradiction to the definition of r*(p n ).| 

We next examine when the function r*(p) is 
Definition . For y 6 E 11 , |y| = j ^T* y^ 2 

Lemma III.3 * Let R be a strictly convex, compact set in E 11 which has a 

unique outward unit normal n(r) at each point r£ ^R. Then for fixed 

p €. E 11 - \o) , F(p, • ) achieves its maximum at the unique point 

r € dR such that n(r ) = p/l pi . 

— o o 

Proof Assume without loss of generality that zero is an interior 

point of Re 

For x €, E n , let I(x) = j a: a > 0, a ^*x £ R ] and define 

fo(x) = inf* a. ; (o(x) is called the support function of R, or also the 
a £ I(x) 

Minkowski functionale We note that if r € and y is any vector, then 

o 

ot y + r 

for a real scalar oC> 0, _ o ^ 

£>(ody+r Q ) 

and for oL sufficiently small, is in a neighborhood of r o * 
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From lemma III. 2, we know F(p, • 
point on c)R , let r Q be the point. 


) achieves its maximum at a unique 

Let g(y, r Q ) = lim ^ j ^11 1° 

o6. + — ►O ^(o!.y+r o ) 



Since ^R has a unique outward normal at each point, g(y, r Q ) = -g(-y,r Q ). 
Since P • > P r for all re BR in a neighborhood of r Q , it follows 

that p . g(y, r ) < 0 for all y. Assuming there exists y such that 
P*g(y. r ) < 0 implies p • g(-y, r Q ) =»0, a contradiction. Thus 
P • g(y, r ) = 0 for all y» or a necessary condition that r Q presents 
F(p, • ) a maximum is that p be orthogonal to the support hyperplane at r Q . 
Since R is strictly convex it is easily shown that there are exactly 
two points which satisfy this necessary condition, one with outward 
normal p/|pl giving F a maximum, the other with normal _ p/|pl which 
gives F a minimum. | 

Definition . We say that a strictly convex, compact set R in E 11 has a 
smooth boundary if there exists a unique outward unit normal n(r) £• C 
defined on c) R. (Actually we consider n as a restriction of a C 1 
function in a neighborhood of r£ ^R , see, for example, pg. 27). 

Theorem III. 2 . If R is a compact set in E n with smooth boundary having 
positive Gaussian curvature at all points, then r*(p) £. C . 

Proof Since it is assumed that the unit normal to ^ R is of class C , 
the Gaussian curvature is a continuous positive function on c)R. But ci R 
is compact, thus the Gaussian curvature is bounded away from zero. From 
theorem 5.5 |5 , pg. 35 J it is easily followed that R is strictly convex. 

From lemma III. 5, we have thar r*(p) satisfies n(r*(p)) = p/ | P I - Let 
r Q = r*(p Q ) be an arbitrary point on R . 




96 


The method will be to utilize the implicit function theorem on a 

relation of the form g(r, p) = n(r) - p/lpl . 

Let . . . , be a local coordinate system for a neighborhood 

of r on R. Then the inclusion map from <) R E 11 determines n 

o 

smooth functions x^( C\ <..., )»•••» x q ( £ * • • • * C ) 

or briefly x( £ ) „ Assume x(0) = t q and let be a measurable neighborhood 
of zero in the local coordinate system. 


Let S n_1 be the unit (n-1) sphere; we consider n( • ) : dR- 
Define 9( • ) : V n ► S n 1 by n(x (^)) = 9( £ ) . Thus 


.n-1 


n e c 


^eec 1 . 


Let £^p) = p/lpl , p 6 E? - (oj ; then € C 1 . Our approach 

will be to utilize the implicit function theorem on the relation 

G(C* = ©<f ) 

We note that G £ c\ and if = ^P Q ) ^(0, = 0. Also 

(0, (0)* It must be shown that det(©£ (0)) / 0. 

From differential geometry we recall that as £ varies in x( £ ) 

traces out a region on ^ ® while the normal Q( ^ ) traces out a region 

V on the surface of the unit sphere. Let K(£ ) denote the Gaussian curvature 
2 \ 

of ^R at x( ^ ) , and A, the "area" of V . Then 


a 9(0 

~W~ 




= ky Since is 


A 3 = J K(f)d£ . But J det ^ 

V V 

1 1 A 9(^)\ 

arbitrary (but measurable) and 9 €- C"^, this implies det ^ j - K( ^). 

By assumption K is positive at all points of d R , hence 
det(9£ (0)) / 0. The implicit function theorem now gives the existence 
of a function £ ( such that G( £ ( 9^)» ^) 5 0. 

Then r*(p) = x( f (^(p)))£ c\| 
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The following is an example of a strictly convex set R with smooth 
boundary and a point at which the Gaussian curvature K is zero, for which 
r*(p) is not C*. 

Let part of the boundary of EC E 2 consist of the curve y = x\ the 
rest so as to make R strictly convex and with smooth boundary. We restrict 
our attention to the defined part of the boundary, in particular to the 
point (0, 0) at which K is zero. 

The outward normal is given by (4 x^, -1). Let p = (p^, P 2 ) have p 2 
negative and p^^ near zero. To compute r*(p) 9 (x*(p), y*(p)) we compute 

4 

the point on the curve y = x where the normal has direction numbers 

V3 

(- P l/p 2 , -1). This gives x*(p) = ( _P l/4 p 2 ) , y*(p) = ( P l/4p 2 ) 


and 


o> x*(p) 
d p. 


is seen to not be continuous at p^ = 0 . 
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APPROXIMATION OF OPTIMAL TRAJECTORIES 


The Time Optimal Problem 

Consider the system (3-1), with U a compact set, and initial data 

x(t ) = x . Let S be a smooth (C 2 ) manifold in the (n+1) dimensional 
o o 

(t,x) space with the property that for any t^, ^(t,x)£S: « t 2 t^j 

is compact in E“ +1 . The problem is to find a measurable function u = u(t) 
having values in U, such that the solution of the initial value problem 
for (3-1) with u = u(t), intersects the target S in minimum time; i.e», 
is an optimal trajectory. 

We next give the conditions of Fillipov ^2j , which insure the 
existence of an optimal (open loop) control, and optimal trajectory for 
the time optimal problem. 

Existence Conditions 

( 3 - 3 ) f(t,x,u) is continuous in all variables t,x and u, and is 
continuously differentiable with respect to x« 

(3-4) x • f(t,x,u) 2 C( |x + 1 ) for all t, x, u. 

(3-5) R(t,x) = ^ f ( t,x,u) :u € u| is convex for every t,x. 

(3-6) There exists at least one measurable function u(t) with values 
in U, such that the corresponding solution of the initial value, 
problem for (3-1) attains the target S for some t^ > t Q . 

Equivalence of Problems 

Let the same time optimal problem, as posed for (3-1 ) 9 also be posed 
for the system 

(3-7) x(t) = g(t,x(t) ,v(t)) , v(t) e V, a compact set, 

where g satisfies condition (3*3)* Let Q(t,x) = £g(t,x,v):v €, V j 
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Theorem III. 3 Assume the existence conditions are satisfied for the 


time optimal problem for the system (3-1). Let ^( • ; u*) denote the 
optimal trajectory and u* the optimal control. Then if Q(t,x) = R(t,x) 
for all (t,x), • ; u*) is an optimal trajectory for the time 

optimal problem for the system ( 3 * 7 ) and there exists a measurable 
function v*(t) with values in V such that (t; u*) = g(t, 9?(t;u*) »v*(t)) 
almost everywhere. 

Proof f(t„ ^(t; u*)» u’(t)) is a measurable function of t, with 
values (almost everywhere) in R(t, £^(t; u*)), therefore in Q(t,^(t; u*)). 
From lemma 1 of Fillipov j^J , there exists a measurable function v*(t) 
with values in V such that f { t , '/&( t ju* ) ,u* ( t) )=g( t , </? ( t ;u* ) , v* ( t) ) almost 

o 

everywhere. It follows that ^(t; u*) = g(t, ^?(t;u*) , v*(t)) almost 
everywhere. 

Now if ’ | u*) were not an optimal trajectory for ( 3 - 7) 9 ioe 0 , 

J^( • 5 v) provides a better time, the same argument shows that * ; v) 
is a solution of (3-1) for some measurable control u with values in U, 
thereby contradicting the assumed optimality of • 5 u*)«| 

This theorem stresses the fact that in seeking optimal trajectories, 
it is the set function R(t,x) which is of major importance, not the 
function f(t,x,u) or the control set U* 

When the conditions of theorem 111*3 are satisfied we define the time 
optimal problem for the syst em (3-7) to be equivalent to that for (3-1) *> 

If the existence conditions are satisfied for the time optimal problem, 
from conditions (3“*0 and ( 3—6) we can obtain a compact region of (t,x) 
space to which analysis can be restricted* Indeed for t 2 t — t^ 
condition (3~*+) implies any solution x(t) of (3-1) satisfies 
|x(t)| 2 2 (| x 0 | 2 + 1) exp (2C |tj-t | )o Here |x(t)| stands for the usual 
Euclidean norm* Henceforth, we denote by <£? the compact region of (t,x) 
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space defined by t <2 , |x| 2 « ( |x q | 2 + l)exp (2C |2tj-t o j )• 

Definition . The Hausdorff metric topology for non-empty compact sets in 
E 11 is derived from the following metric: The distance between two non- 

empty compact sets X and Y in the smallest real number d = d(X,Y) such 
that X lies in the d neighborhood of Y and Y lies in the d neighborhood 
of X. 

€ Approximate Equivalent Problems 

Defi"i tion . For given £ ■=*■ 0 the time optimal problem for the system 
x = h e (t,x,v), h^ continuous on E^XE^XVC €. ) , is said to be an £. 
approximate equivalent problem to the time optimal problem for (3-1) if the 
set R(t,x,£) = (h £ (t s x,v): v£V(C)j ID R(t,x) and 

d(R(t,x, £ ), R(t,x))«C for all (t,x)£ A . 

Since h^; (t,x 9 • ) is continuous on the compact set V(£), R(t,x,£) 
is compact* 

Theorem III 0 4 . Assume that the Fillipov conditions (3-3) 9 (3- 1 *) and (3*5) 
are satisfied for the time optimal problem with system equations (3“1)* 

Then for every £ 0 there exists an £ approximate equivalent problem 

° r 

with system equations x = li (t,x 9 v) 9 v€V(£) which satisfies the 
following properties o 

a) The control set V( € ) can be taken to be the unit ball of E 11 , 
which we denote B n <> 

b) h^ is a C°° function on x B n , while for each (t,x)£ , 

h^ (t,x, • ) is one-one on B — E 11 « 

c) The set R(t,x,£) = ^h £ (t,x,v): veB n j has smooth boundary 

having positive Gaussian curvature. 
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d) The (single valued) function v*(t-,x v p) with values in B n which 
maximizes H( t ,x,p»v; £. ) = p • h G (t,x,v) - 1 for each (t,x)£ , 

p € E n - [O } , is C 1 in t,x, and p. Actually v*(t,x,p) €. 'b B n = S n 1 , 
the (n-1) sphere. 


The proof will proceed by obtaining a eimplicial approximation to 8 
in which the diameters of the simplexee are sufficiently small. For 
each vertex (t^, x^) of a simplex, we approximate the convex set 
R(t^, x^) by a strictly convex set Q(t^, x^, €. ) having positive 
Gaussian curvature. A vector function g "(t^, x^; • ) is then con- 

structed so that Q{ t . , x 1 ; ' ) - \ g*' (v„ x^ v) s v 6 B n j , and by use 
of g^ , the set function Q is extended continuously to all of in 
such a manner that for each ( t,x.)C A , Q(t,x; € ) has smooth boundary 

£L 

with positive Gaussian curvature« The desired function h is then ob- 
tained by smoothing the fun:-iio& g'= in the variables (t,x) via the 
Fri e dr i cks mo] 1 1 f i e r tec hr i que . 

Proof is conr i naouBy in the Hausdorff metric topology.,, on the 

compact set- P o Tor any £ ** 0 ;et 5 . ^ 0 be such that 

n+ 2 

d(R(t,x),R(t l ,x"))< f - /8 whenever \\ t ,xM t ’ ,x • )l < b. Let <T be 

6 

any bounded geometric simplex which contains 9 and K be the geometric 

complex consisting of this single ©implex. By bary centric subdivision 

K can be subdivided into a geometric complex K s consisting of a family 
g 8 

of geometric simplexes \ 6’ > , each having diameter less than b° 


Each point 


(t,x) - 


n + 2 

z 


(t 9 x )£.;/)*' has a unique r epresentati on of the form 

ox {i . > ) w; th 0 oC~ ^ 1 9 5 "o 6. = 1; where the 

i i i • i L -> n 


is.i. 


(n+2) points ( t . , x, ) 

, f - r,+ i ) 
the family | 


ar*“ the vertices of the geometric simplex from 
;o which the point. (t-,x) belongs. Without loss of 
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generality we can now consider the union of the members of n | 
which have all vertices in <&' as a new domain of interest; call this 
domain again <A~ . 

Let (t^ be an arbitrary vertex in . Then R(t^, x^) is con- 
vex. Let )l(R(t^, x^), £/4) be a convex C/4 neighborhood of 

R(t ± , xJ. From [6 , pg. 38] there exists a strictly convex set 

Q(t ± , x ± , C ) containing U(R(t i9 xj, C/4) ; having an analytic boundary 
with positive Gaussian curvature, and such that 
d(Q(t i ,x i , € ), r LtR(t., xj, C/4 ))< £ /4. 

For each (t^, x^)C we construct a corresponding set Q(t^, x^, € ) as 

above 0 We next proceed to define a set valued function Q(t,x, £ ) on all 

of jy . 

It can be assumed without loss of generality that 0 C R(t,x) for all 
(t,x) € o Indeed if this were not so, one could choose a point 

u £ U and construct new sets S(t 5 x) « f t\t,x,u) - f(t,x,u . ): u £ U [ 

O v ° ' 

which satisfy this property 0 

1 1 n-1 

Let B n be the unit ball in E* 1 ; S D its' surface and v »...,v a 

coordinate system on S n while v n measures distance from the origin. 

1 2 tl-1 

Then a ray from the origin through (v , v , . . . , v ,1) strikes 

1 n _l 

c)Q(t^, £ ) in a unique point which we denote g^* (t^,x^»v ,...,v ,1). 

This defines g e (t ,x„ , •) on S 11 " 1 ; to extend it to B n let 


l l 


/ J 
v = (v 


* R ) B n „ Define g & (t.., x ± „ v) as that point in 


Q(t i , x.. , £ ) which lies on the ray through the origin and 


( y l 

\» } o o 


n-1 


l) and is such that 
i g € (t 9 x,v) I 


ig e (t,,x,v 


n-1 


= v 
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Then g € (X, x.^, • ); B n — Ql^, £ ) in a one to one fashion. 

We will define Q(t, x, £ ) on all of <A" by extending the definition 
of to all (t,x)£ o 

n+2 

Assume (t,x)€. A. Let (t 9 x) = ^ (t i9 x i ) be the unique 

i=l 

representation of (t 9 x) in terms of the vertices of the geometric 

simplex of K" to which it belongs. Define 
s n+2 

(t,x,v) = ^ oC i g^ (t i# x i9 v), v€ B n . Then if 

i=l 

Q(t,x 9 6 ) * ^g^" (t 9 x 9 v) s v € B n | it follows that; 

V ) 

i) yi(R(t,x) 9 ^/8)CQ(t 9 x 9 €: ). Indeed, from the choice of S , 
H(R(t s x) 9 €/8)C>l(R(t i9 Xi ) 9 6A)CQ(t., x i9 € ) for all 

vertices (t^ s x^ ) of the simplex in which (t 9 x) is contained. 

But Q(t,x,€.) - QC t ± x i9 6 ). Thus if a point is in 

?X(R(t, x) 9 £/8) it is in Q(t,x, C ). 

ii) d(Q(t,x 9 £), R{ t v x)) -C To show this one notes that 

R(t. 9 x. )C ^l(R(t. 9 x . ) 9 €/4)C Q(t , x , € ) for all 

^“1 J J 3 3 

i, j = 1, 2, . . . , n+2. Therefore 

d(R(t,x) 9 Q(t 9 x 9 £) « d(R( t 9 x) , R( t ^ „ x^ ) ) + 
d(R(t, , x, ) 9 Z oC, Q(t ,, x , G ) ) ^/8 + 

j X J J 

max j^d(R(t i? x i >,Q(t , x^ 9 £ )J < G /8 + 

3 

max fd(R(t , x„),R(t 9 x j)+d(R(t , 9 x,) 9 Q(t.„ x. 9 £))]^ 3 € /4„ 

i *• i. x 33 33 33 ~ 
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iii) Q(t,x,€.) is strictly convex, with smooth boundary having positive 

Gaussian curvature, for each (t,x) . Indeed of K(t,x, v^ - , . . . ,v n 

is Gaussian curvature at the point g( t,x, v 1 , . . . ,v n 1 , l)€- ^R(t,x,£,), 

then K(t,x,v\ „ . . ,v n ^ K(t^, x^, v\...,v n ^). 

i=l 

iv) From the construction, g^ (t,x,v) is analytic in v for fixed (t,x) 
and continuous in (t,x) for fixed v 0 


Combining the results of 


i) and ii) shows that for (t,x)€- 


Jtr 


1 


>?(R(t,x), e/8) C Q(t,x, £)C K(R(t,x), 3fc/4)» 

It will next be shown that using g e (t,x,v) one can construct a mapping 

h € ( t , x , v) on cA x B n *=> E 11 such that if R(t,x,e) = ^h(t,x,v):v£ B n | , 

then R(t,x, £) is a strictly convex, compact set containing R(t,x); 
d(R(t,x, 6) ,R(t,x,)) 6 ; c)R(t,x,£) is smooth with positive Gaussian 

curvature, and if n(t,x,h^ ( t ,x, v\ . . o , v 11 \l) is a unit normal to 
^R(t,x,£) at h € (t,x, v 1 ,o„.,v n “ 1 , 1) then it is a C 1 function of all 


arguments. 

For simplicity of notation let y = (t,x) denote a point in <A- 9 and 
let S k (y-y) be a mollifier function; see [?j . As an example one could 


n+1 

choose S k (y~y) = (k /h'K) 2 exp 


r n+1 


k 

“4 


[I </ - 5 1 )' 

L i-1 


Extend g^ (y,v) as the zero function for y in the complement of Jh 


Define h (y,v) - 


/, 


s k (y-y) g~' (y,v) dy 
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1c 1c • 

Then for every integer k =» 0, h is an analytic function, while h and its 

derivatives with respect to v tend uniformly to g e and its derivatives 

with respect to v. 

Let R k (t,x,e) = jh k (t,x,v) i v £ B n ^ . Since the Gaussian curvature to 

^Q(t,x,€) is given as a multilinear combination of the derivatives 
g € vt (t,x,v 1 , o . o ,v n ~ 1 ,l) while the curvature of R k (t,x,£) is given by the 

same multilinear combination of the derivatives h k f »t,x,v\ . » o ,v n '*',1); one 

\ 1c 

can choose k sufficiently large so that oH (t,x,€.) has positive 
Gaussian curvature while R(t,x)C H (t,x,£)C /l(R(t,x) , . For such a 

choice of k, define h^ (t,x,v) = h k (t,x,v), R(t,x,£) = -^h^ (t,x,v) : v € B n ^ . 

From its construction, h^" satisfies conclusions a), b) and c), while a 

unit normal n(t,x,h e (t,x,v\ . <, . ,v n \l)) to ^R(t,x,£) is a function 
, 1 n~l, 

of (t,X, V ,00=, V ) . 

It remains to show part d). Using lemma III. 3 define r*(t,x,p;£) as 
the unique point on ^ R (t,x, € ) such that n(t,x,r*(t,x,p,€-)) = p/lpl » It 
will be shown that r* is a function of t, x, and p by a proof similar 
to that of theorem III„2o Defining v*(t,x,p) as the unique point on ^ B n 
such that h e ( t , x, v* ( t ,x,p) ) = r*(t,x,p,£) it follows that v* maximizes 
H(t,x,p,v; £) and it will be shown that v* is a G in t, x and p. 

For fixed (t,x), we have 


^ 1 Il“l 

S 11 " 1 Ji— ail*. ^ R ( t , x £ 


,n-l 


which naturally induces a map ©( t ,x, v 3 , . . „ , v 11 from S n ~" L <4 ► S n ■*" de- 

fined by Q(t,x,v\ . „ . ,v n = n(t,x,h e (t,x, v\ . . . , v 11 \l)). Since we are 
only interested in ^ B n = S n \ no confusion should occur if for the re- 
mainder of this argument we let v = (v\...,v n ^) £ S n ^ and therefore write 

0(t,x,v). This will be done. 
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Let y?(p) = p/lpl , p £ E n - ^Oj and define 

Q(t,x,v,^) s 9(t, x, v,) - . We will apply the implicit function 

theorem to Q, which is easily seen to be a C^“ function. For each 
t Q , x q , = P o /|p Q | « there exists a unique point 

r o = r * (t o» V p o ;€) SUCh that if n(t o’ V r o } = p o /lp o' and V o is 

the unique point on S n ^ such that h^ (t ,x ,v ) = r » then 
^ o o o o 

G(t 0 , x o , Y o , <P o ) = 0. One next notes that VVW ^ Q ) = 9 y (t 0 ,x o 

and from the definition of 9 (see also the proof of theorem III. 2) 

det 9 (t , x , v )| is the Gaussian curvature at r €. R (t, x, €_ ) 

L v o o o J o 

which is positive. The implicit function theorem yields the existence 
of a function v(t, x, ) such that G(t, x, v,(t, x, 9^), ^ ) = 0 in 
a neighborhood of the arbitrary point t Q , x q » Then 

r*(t, x, p; £. ) = h £ (t, x, v(t, x, £?(p))) £ C 1 , while 

v*(t, x, p) s v(t, x, <^(p)) is also 



THE RELATION OF TRAJECTORIES OF THE APPROXIMATING PROBLEM TO THOSE 
OF THE TIME OPTIMAL PROBLEM 

We assume the system (3-1) satisfies the Fillipov existence conditions 
( 3 - 3 ) , (3-4), (3-5) and (3-6), with t 1 a time in which the target set S is 
attainable. For any € > 0 let h 6 (t, x, v), v£V(e), be an € approximate 
equivalent problem (not necessarily having the special properties shown to 
exist in theorem III. 4). From condition (3-6) and the relation 
R(t, x,£)ZDR(t, x) , it readily follows that for every £ > 0 there exists 
at least one measurable function v with values in V(£ ) such that the 
corresponding trajectory ^ ( • ; v) of the £ approximate problem 
attains the target S. 

It will next be shown that when dealing with the approximate problem, 
analysis can again be restricted to a compact set. Indeed any vector 
h^ (t, x, v) can be written as f(t, x, u) + o£(t, x) where |o£(t, x)|^- £ . 
Then for any trajectory x(t) of the approximate problem 


I d , ,^.2 


x(t)| 2 = x(t)*n (t,x(t) ,v(t)) = x(t) • f 't,x(t) ;u(t))+x(t)* o£(t,x(t)) 


2 c(i + i x( t ) i ) +eix(t)i 


|r Jtn (1 + |x(t)| 2 )^2 C + 


2&lx(t)| 
1+ lx(t)l 


2 2 (C + 6 ). 


_ 2(c+e)(t.-t ) 

I x( t ) 1 2 2(1 + |x o l 2 )e . 

Define <AT to be the compact region in E dimensional (t,x) space 
so that | x | 2 2(1+ |x q | 2 ) exp ^2(C+£)(2 t^-t Q )] , t Q f t < 2 . 

Theorem III. 5 ° Consider a sequence with 5” 0, ► 0 

and let p denote the time optimal trajectory (assumed to exist) for 

( ) 

the 6 ^ approximate problem. Then \ j is an equicontinous family c 


is an equicontinous family on 
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the interval ^t^, J <> ** has a uniforml y convergent subsequence 

which converges to a function 9 ? having the following properties. 

* 

i) is absolutely continuous 

ii) There exists a measurable function u* with values in U 

such that </>*(t) = f't, £?(t), u*(t)) almost everywhere. 

iii) There exists a smallest t* » t such that t *) €, S 

o 

* 

iv) ^ is a time optimal trajectory for the system (3-D* 


Proof We shall prove the conclusions in the order that they are stated. 

Without loss of generality, assume that R(t,x, €^)Z> R(t,x, € ...3R(t,x). 

1 

Therefore analysis can be restricted to the compact region oO- . Our first 

goal is to show that there is a constant N independent of £ ^ such that 
6 k 

CD is Lipschitz continuous with Lipsehi tz constant N. To accomplish this, 

for a compact set R in E 31 let jo(R) denote max |r| . For fixed 

' r€ R 

R(t, x, € , ) is a continuous set valued function (in the Hausdorff metric 

€ 1 

topology) on the compact set and therefore the composite map 


£> (R(t, x, € j_)) is a continuous real valued functi 


on on 


At' 


hence bounded. 


Let N be its bound « It follows that 


(t, x, v)| < N for all € 


and any trajectory ^ is Lipschitz continuous with Lipschitz constant N« 


Thus 


Irl 


is equicontinuous and has a subsequence which converges 


uniformly to a Lipschitz continuous function 9* , which is therefore 

C € k) 

absolutely continuous. We will not distinguish between \ ? and its 

convergent subsequence. 
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[ * * * 

t Q , t^), ^ (t)). 

Since the set function R(t, x) is continuous in the Hausdorff metric 
topology (a consequence of the continuity of f), for any l) *■ 0 let 
R^l (t, x) be a closed convex i) - neighborhood of R(t, x). Then R^j (t, x) 


is also a continuous set function. 

c. 

Since (t) 6 R(t, (t), €. ) and R(t, x, C ) 


R(t, x) in 


the Hausdorff metric topology, there exists and N such that for all 
n > N, \t) GR^j (t, ^*(t)). Fillipov's proof of theorem 1, j^J 
now applies to show that for almost all t, €. Rj (t, ^*(t)). But 

R(t, x) is closed and l) arbitrarily small, hence ( t) £ R( t ( t) ) for 

almost all t« 

From the lemma of Fillipov |^j , we then obtain the existence of a 
measurable control u* with values in U, such that for almost all 
te [t Q , t x )], C/)*{ t) = f(t, £>*(t), u*(t)). 


iii) Let t 


e k ° 


t denote the optimal time for the £, approximate 


problem. Since R(t, x, 6^)3 R(t, x, it follows that 

(t | is a monotone non-drecreasing sequence of reals bounded above by 

V e 

t 1 . Let t* be its limit. Now 9 s ’ ( ) £- S for each k, and 
1 fc k 

|(t,x) £ S: ^ j i® com P act in E n+ ^, thus ^ ^ € S 

k 


iv) Suppose is n °t a time optimal trajectory for the system 

(3-1). Then there exists a measurable control u with values in U and 

corresponding trajectory * I u) such that ^t 0 i u ) = X Q » 

^(t,; u) £. S and t «= t* . This implies that for k sufficiently large, 

3 j 

t,«= t ; but £?( • | u) is an admissible trajectory to all 6 approximate 

3 e k e k 

problems. This contradicts the optimality of . | 
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This theorem essentially tells us that for sufficiently small £• , 
the optimal trajectories of the €. approximate problem are uniformly 
close to an optimal trajectory of the original problem. 

In the next section the "smoothness” which theorem III. 4 shows is 
possible for the feedback control of the € approximate problem, will 
be exploited to obtain solutions. 

ffgmi lton-Jacobi Theory 

Let the time optimal problem for (3-1) satisfy the Fillipov existence 
conditions. Let x = h £ (t, x, v) denote an £• approximate system with 
the properties a), b) , c) and d), shown to exist in theorem III. 4. For 
the time optimal problem associated with the approximate problem we de- 
fine the functions 

H(t, x, p, v, € ) = p • h € (t, x, v) - 1 
H*(t, x, p, 6 ) = H(t, x, p, v*(t, x, p) , ). 

The inequality 

(3-8) H(t, x, p, v%fc)^H(t, x, p, v, 6 ,) for all v € B n , v / v* 

is a consequence of the definition of v* « 

For the sake of completeness we repeat a short argument of Kalman (^lj , 
pp. 321-322) to show that for fixed € ^ 0, 

H*(t, x, p, e ) = p *hj (t, x, v*(t, x, p)) 

X * 

H*(t, x, p, £ ) 5 h e (t, x, v*(t, x, p)). 

P 

Indeed, we know that v*(t, x, p) €. B B n = S n_1 , thus let g(v) be a smooth 
relation such that g(v) = 0 determines S n 1 in a neighborhood of v*(t, x, p) . 
Then g y (v*(t, x, p) ) v*(t, x, p) = 0 and g y (v*(t, x, p)) v*(t, x, p) = 0. 
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Noting that v* maximizes H(t, x, p, v, £ ) , we consider this maximization 

subject to the constraint v £ S , i.e. , g(v) 5 0. The Lagrange multiplier 

rule implies H y + 1 / g y = 0 where \l / 0 . Evaluting this at v* and multi- 
plying on the right by v*(t, x, p)- and v*(t, x, p) , in turn, gives the 

X p 

required result. 

€ e 

If ^ are solutions, respectively, to the boundary value problem 

(3-9) x = H* ( t , x, p, £ ) = h € (t, x, v* ( t , x, p)) 

(3-10) p = -H*(t, x, p, £ ) = - p • h* (t, x, v* ( t , x, p)) 

A x 

with boundary data x(t Q ) = x q , x(t ][ ) = x , then ( 3 - 8 ) shows that 
£ 6 

v*(t, £^(t), ^ (t)) satisfies the necessary condition termed the maximum 

principle, for being an optimal (open loop) control for the time optimal 
problem of attaining the state x^ from the state x q for the approximating 
system. 

It should be noted that under the conditions assumed, v* £ C^" and the 
initial value problem for the equations ( 3 - 9) 9 ( 3 - 10 ) with data given at t 

o 

will have a unique solution in a neighborhood of t . If v* is discontinuous, 

o ’ 

this presents a serious difficulty in the application of the maximum principle. 

With the (Hamiltonian) function H*(t, x, p, £ ), £ >0 and fixed, we 

associate the Hamilton-Jacobi partial differential equation 

(3-11) V (t,x) + H* ( t, x, V (t, x), £ ) = 0. 

t X 

Let the target S be a ’’smooth" n-dimensional, non-characteristic manifold 
in the (n+1) dimensional (t,x) space, and prescribe the Cauchy data 
V(t,x) = 0, (t,x) 6 So The solution, in the classical sense, of this partial 
differential equation problem, we denote by ; the domain of solution by 

<£n e , s). 
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The characteristic equations associated with (3-H) are the 
equations (3-9), (3-10). If a point (t Q , x q ) is in S) there 

exists a point (t^, x^) € S such that the boundary value problem con- 
sisting of the equations (3-9), (3-10) with boundary data for (3-10) 

being x(t ) = x , x(t, ) = x., , has a solution. The solution of such a 

ooll 

£ £ 

boundary value problem, when it exists, will be denoted by ^ ^ • 

From the continuity condition, for each 6^0, V^(t, ^ (t)) exists 
and satisfies equation (3-10). (See for example ^lj ). Thus we can 
make the association $4* (t) = V (t, ^ (t)). 

Let € , S) denote the set of points (t Q , x q )£ °^"( £ , S) for 

which t « t^; (t^, x^) being the point on S joined to (t Q , x q ) by a 

curve . Assume (t , x )£ ( €, S). If we use the initial data 

o o 

x(t ) = x , p(t ) = V £ (t , x); by virtue of knowing a solution of the 
o o o x o o 

partial differential equation we have the proper initial data to reduce 
the previous two point boundary value problem for (3-9) and (3-10) to an 
initial value problem^, Thus to determine the trajectory ^ we can 
consider the system 

(3-12) X = H*(t," X, V e (t, x); e ), x(t ) = x . 

p X o o 

The major advantage of this method is that now v*- = v*(t, x, (t,x)), 

i.e., a feedback control. 

Theorem III. 6 (Kalman) Assume (t , x )€ <^ ~( £ , S) ; V 6 is the solution 
“ o o 

of the Hamilton-Jacobi equation (3-11) and ^ the solution of (3-12). 
Then is a time optimal trajectory relative to all trajectories 

^( * ; v) which connect (t^, x q ) to S and lie in (6, S) . 
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Proof Assume, without loss of generality, that (t . x ) §£-S. 

Q © 

£ 

From the definition of H, H* and V , 

0 5 V t £ (t, x) + V^(t, x) * h € ( t , x, v*(t, x, V*(t, x))) - l>V*(t, x) + 

V^(t, x) •h 6 (t, x, v) - 1 for all v€B n , v / v* . 

£ 

Assume that t £ (t £ *■ t Q ) is the first time such that (t £ , ^ (t^ ))6 S. 

Let Q denote the set of measurable control functions having values in B n 
and leading to trajectories of the € approximate problem yhich connect 
(t Q , x q ) with a point on S and lie in S) . Then is not empty since 

(t Q , x ) €>'(€, S) and a characteristic implies 

[(t, ^ € (t)): t Q - t - t £ | is in S). If v*(t,/(t), V^(t,^(t)) 

is the only function (to within a set of zero measure) in , the result 

is trivially true* If this is not the case let v = v(t) be any function in 
differing from v*(t, ^ (t) , V^( t, t) ) ) on a set A of positive 
measure o Let * 5 v) be the corresponding solution of the approximate 

system and t^ the first time such that (t^, Cp (t^; v)) £ S. ( t^ > t ) . 

We must show ^ t^° 

Calculating 

v 6 (t, ^(t; v)) - 1 = V t e (t, + V^(t,£?(t;v)) • h^ (t,^?(t;v) ,v(t))- 

for all t and strictly less than zero for t 6 _A_ , implying 

V 6 (t 2 , ^(t 2 ; v)) - V e (t o , x q ) •£ t 2 - t Q . But V £ (t 2 ^ t 2 ; = 0 

since ( t^ » ^(t^; v)) £ s, yielding -V ^ (t Q , x q ) < t 2 - t Q . Similarly 

— V (t, y? (t)) -1 = 0 implying - V(t,x) = t/- - t „ Combining the 
at / o o c o 

last two inequalities gives t g <■ - t^ as was to be shown. | 
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THE CONSTRUCTION OF APPROXIMATING PROBLEMS WHEN THE CONTROL APPEARS LINEARLY. 


Theorem III. 4 gives conditions for the existence of an € equivalent 
approximate problem which has the unit ball B n as the set of values which 
the control can assume. However, the functional form of the approximating 
system is allowed to vary with £ . 

In this section we consider a system of the form 

(3-13) x( t) = g(t, x(t)) + H(t, x(t)) u(t), 

u(t) £ U, a compact convex set in E 1 * with 1 r 2 ® nxr matrix valued 

function; while g is a C^, n vector valued function. For such systems 
it is possible to provide a simple construction for €. approximate problems. 

Since, for the approximate problem, one desires R(t, x, £ ) to be strictly 
convex and lemma III.l shows this implies non void interior , one is led to 
extend H to an nxn matrix valued function and approximate the control set by 
a compact set V(€ ) which contains U. Furthermore, V(€.) should have a non- 
void n dimensional interior, a smooth boundary with positive Gaussian curvature, 
and be such that in the Hausdorff metric topology, lim V(€ ) = U. 

e— -> 0 

The method of construction and the application to approximating problems 
will be demonstrated in a two dimensional example; its generalization to 
higher dimensions being immediate® 

Example III.l (Bushaw control problem). 

Consider the time optimal problem for the system 
« 

( 3 - 14 ) x = x 

1 

o 

X = - X. + u 
2 

with arbitrary initial data x(0) = x^, and target S = £(t, x^ $ i 
x = 0 , x = oj. The control u is to satisfy -1 « u(t) « 1, i.e., U = £-1,1 ] • 
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As an £ approximate problem we take the system 


(3-15) x l ~ x 2 + T i 

X 2 = “ X 1 + V 2 

with the same initial data and target, but with V(£ ) = ^v £ E 2 : € 

i.e., an ellipse with semi major axis 1 and semi minor axis €. . Thus in the 

Hausdorff metric topology lim V(£) = U, and ^R(t, x, £ ) is smooth with 

e-*- 0 

positive Gaussian curvature. From the Hamilton-Jacobi theory 


H(t, x, p, v,€ ) 5 p^ + p^ - p^ + p 2 v 2 - 1. 

Using lemma III. 3 one computes 

v*(t, x, p* = [^ 2 Pi 2 +p 2 2 ] ’ p 2 [^ 2 P l 2 + V 2 1 

from which it follows that 


H*(t, x, p, €. ) = p 1 x 2 



- 1 . 


The associated Hamilton-Jacobi equation is 


(3-16) V (t,x) + x V (t,x) 

t c. X^ 


x V (t,x) 
1 X 2 


+[e 2 v 2 (t,x) + v 2 

L x ± x 2 



Since the independent variables appear linearly, while the dependent 
variable has derivatives which appear non- linearly , the Legendre contact 
transformation is suggested. Let V(t,x) = W(t,p) - p« x. Then 
V =-p, W = x and the transformed equation is 

X p 

% 

w t (t, P ) - w p ^(t, P ) + p 2 (t, P ) + [e 2 p x 2 + p 2 2 ] - 1 = °* 

The characteristic equations associated with this linear partial differential 
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equation are t’(T“) = 1 , p^CZ"") = p 2 (T), p^C 7”) = - yielding 

solutions: t = V + = odsin ( 7~+ (3 ) , p 2 = o£cos (7~+ with 

# arbitrary constants. Then W(t('T’), p(T)) = 1 -^€ 2 p 1 2 (T)+ p 2 2 (T)]^ 


which, after a slight calculation, gives 

a- o 


r K o - t; ) 

W(t,p 1 ,p 2 ; S , "?0 = t-^+S+ / [€ 2 (p 2 sin 7“ + p 1 cosT ) 2 +(p 2 cos?”-p 1 sin^) 2 ] 


dt. 


For a time optimal problem with autonomous system equations and target a 
a point in state space, the constant 5 is inconsequential. We consider 
S = 0 and omit further reference to it. 

By virtue of the transformation, solution trajectories to the system 
(3-15) with v = v*(t, x, p) are given by x(t ^ ) = W^(t,p(t; 
or specifically 


x-^t; ^ ) =j 


(*- t) 

o6e 2 sin(2 T~ +j&)cos?~- oCcos {Z'C+jQ') sinf 

— — jg" 

|g 2 od. 2 sin 2 (27”+^) + od. 2 cos 2 (2T~ yff) ] 


dr- 


* 2 (t; V ) 



o d. € 2 sin(2 T~+/&) sin T~ + odcos (2 cosT* 

^£ 2 o(. 2 sin 2 (2 T* "iy&) + ad. 2 cos 2 (2 7“ ^ _ 


d r 


These formulas can be interpreted as follows. If we choose > 0 and t - 0, 
^x(0; oLi/@ » 0): (od 1 ^ > ) € E 2 j gives the set of initial points x q from 

which the origin can be reached in time by trajectories which satisfy 
(3-15) with v = v*(t, x, p). In particular, it can be shown (via the theory 
of homogeneous contact transformations) that the jacobian determinant 
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is zero, and in this case the set of initial points forms 


^ (x 1 , x^) 

a (oc ,;£) 

a closed curve in for each ^6 > 0. 


To generate a field of extremals (it is to be cautioned that the term 
extremal is to be taken in the sense of the classical calculus of variations; 
i„e„, not necessarily to infer optimality) choose '#’=0 and replace t with -t 
in (3-17). For each choice of one obtains an extremal which is at 

the origin at time zero. Varying © now gives a field of extremals. 
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SUMMARY 


A method is presented for finding an extremal solution of launch trajec- 
tories involving certain inequality path constraints. The method of solution 
is an extension of a technique outlined by Deribow. Deribow has formulated the 
problem considering two or more intersecting admissible arcs; and by a suitable 
transformation of the independent variable, these admissible arcs may be trans- 
formed into a single admissible arc in the problem of Bolza. The transformation 
leads to additional transversality conditions at the intersection of the arcs in 
addition to the usual set of tranversality conditions at the final time. 

The problem considered in this report is the maximization of final weight 
and is solved numerically by the Mayer formulation of the calculus of variations. 
The problem is represented by three arcs - two unconstrained arcs and one con- 
strained arc due to the inequality path constraint. 

The optimum control variables are obtained from the Euler -Lagrange equa- 
tions while the trajectory is moving along the unconstrained arcs. While on 
the constrained arc of the trajectory, the control variables are determined 
from the constraint equation. 
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Section I 


INTRODUCTION 


A method is presented for finding an extremal solution of exit phase 
trajectories involving certain inequality path constraints. The example 
problem considered involves determining the optimum control program to extre- 
malize any desired functional of the coordinates (final vehicle mass). The 
method of solution employed is an extension of the classical method of the 
variational calculus as outlined by Deribow. Denbow has shown that two or more 
admissible arcs can be transformed, by a suitable transformation of the inde- 
pendent variable, into a single admissible arc in the problem of Bolza. The 
problem of concern has three admissible arcs as shown in Figure 1. The results 
of Deribow' s work indicates that at the intersection of these arcs certain trans- 
versality conditions must be satisfied in order to insure an extrema over the 
trajectory. 

The matrix form of the transversality equation is used to determine the 
necessary transversality conditions. In addition to the final transversality 
conditions, certain other transversality conditions must be determined at the 
intermediate points (corners of arcs) along the trajectory. 

The equations of motion are written for a point mass in three-dimensional 
space in an inverse-square gravitational field. The optimum control variables 
are obtained from the Euler -Lagrange equations on the unconstrained arcs of the 
trajectory and from the constraint equation while on the constrained arc of the 
trajectory. 

This report presents the solution to the problem as a result of the 
application of Denbow' s results. 
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Section 2 



MATHEMATICAL FORMULATION 


2.1 Statement of Problem 

The problem may be stated as maximizing the vehicle mass to some given 
set of end conditions through first stage flight. In addition, certain in- 
equality path constaints are imposed on the trajectory. The statement "first 
stage flight" entails the meaning of flight within the sensible atmosphere. 

The particular constraint considered in this report is the product of the angle 
of attack of the vehicle and its dynamic pressure. This type path constraint 
gives an indication of the aerodynamic loads the vehicle will encounter; hence, 
this constraint may be used to control the structural bending. 

The method of solution chosen is an extension of a technique outlined by 
Deribow, Reference 1. This method is essentially the classical calculus of 
variations technique developed by Bliss (Reference 2). 

Denbow has shown that the original problem may be transformed to an equi- 
valent problem of Bolza (the Mayer formulation is used) by a suitable trans- 
formation of the independent variable. This transformation takes the three 
subarcs depicted in Figure 1 for our problem and combines them into a single 
admissible arc. The results of Denbow’ s paper applied to the stated problem 
lead to additional transversality conditions at the variable intermediate 
points "t, and • 

2.2 Equations of Motion 

The equations of motion are written for a space vehicle traveling in an 
inverse -square force field. Three degrees of freedom are used to describe the 
motion about a non-rotating spheroid. The trajectory variables are defined as: 

Vehicle Mass 

Rj Inertial Frame Position v - ' . (2.1) 

\J^ Inertial Frame Velocity v X ( 
and the control variables as: 

% Inertial Roll Attitude of Thrust Vector 

X Inertial Pitch Attitude of Thrust Vector (2.2) 

t" Inertial Yaw Attitude of Thrust Vector 
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FIGURE 





Time, t, is taken as the independent variable. The mass flow rate is constant 
and the thrust vector, T, is assumed to be directed along the longitudinal 
axis of the missile. In addition, roll effects are ignored throughout the 
analysis . 


The powered flight equations are then 

rn_ 


X = 


Ai f &z 


(2.3) 


— a 

where A x , is the non-gravitational acceleration vector in the inertial frame. 
The thrust and drag are given by 

Ar - — CX- C.T 


A t r £• 

r Y rvv 

c X ~ C oc 7 J 5<ric.) 

where 

p” - n - f, 

Ti. - tKvuft at ftliiiuje. Y m - ay IS 

Fyr - cUa-q 4.1°^ a/'S 

and is the gravity vector in the inertial frame. The aerodynamic lift force 
has been neglected in this formulation, The addition of lift would add ex- 
cessive coTTipli cation to the control equations and require supplemental iteration 
to arrive at a solution. 

For consistency in nomenclature 9 the differential equations of motion for 
our problem will be written in the following vector notation. 

~ ><V ~ \ ; ($ ~ ' 7 7 ) (2#u) 

where Y^ is given in equation (2 .3) . 
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and 





is a ^ -vector of known functions of Xtt), Art), and t , assumed everywhere 
differentiable with respect to X and**. 


2.3 Mayer Formulation of Problem 


The functional to be maximized. Equation (2.7) is written directly in 
terms of the boundary conditions. Since the functional does not involve an 
integral (as in the problem of Bolza), the problem can be stated using the 
Mayer formulation of the classical method of the calculus of variations. The 
Mayer formulation of the problem may be written explicitly as : In a class of 

admissible arcs, satisfying the differential Equation (2.1*) 

~ ~ \ (X)'*,*) ~ O 

and end conditions of the form 


as well as the inequality path constraint 


^ = »<T ~ *S 


£ O 




( 2 . 5 ) 


( 2 . 6 ) 


where c< T is the total angle of attack and 3 is defined as the dynamic pressure, 
find the specific arc that minimizes P 


7 - (2.7) 
It may be recalled that the problem of maximizing final vehicle mass is 
identical with that of minimi zing(-V^. 


The problem is shown schematically in Figure 1. The intermediate points 
"L j and t s , vary with each independent trajectory. The entry corner, t, , (point 
where the trajectory goes from unconstrained arc onto a constrained arc) is 
defined to be that time at which the product <=( $ just equals to the maximum 
allowable value. The exit corner (point where the solution goes from the con- 
strained arc onto an unconstrained arc) is denoted by t 2 . The criteria for 
determining this time, t 2 > will be given in the Section on Control Equations. 


By the calculus of variations technique, an extremal solution must satisfy 
the following Euler -Lagrange equations given explicitly as 


> hi + ~i- - o 

- o 


( 2 . 8 ) 
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where 


A(t) - 


3, (0 


hit) 


(2.9) 


— T 


is a vector of Lagrange multiplier functions and ~X (£) is the transpose of 

7 ay 


The above Euler-Lagrange equations were derived explicitly through use of the 
augmented function 

- (^i ~ > * x ) ^ ^2 Vx) ^ ^3 (^ 3 ~ ^ ^4) 

+ ^r(X s -A K ') 4 ■ ^ (x 6 - A Y ^) 4- 2 7 (* 7 - 

2.1* Optimum Control Equations 

The optimum control variables, while on the unconstrained arcs (I and III), 
are found by solving the second set of Euler-Lagrange Equations (2.8) explicitly 
for ^ and X • 

The optimum control variables, while the trajectory is on the unconstrained 
arcs, are then found to be 


M. ~ 


f4 1 


0 

/ - h > 

T 6 



£akj 

r 



£ Akj 

( 7 S J J 

- 





( 2 . 11 ) 


When the vehicle is moving along the constrained arc II, one of the^ 
control variables may be obtained from the constraint Equation (2.6). This 
reduces the degree of optimality by one for flight along the constraining 
arc. (Although strictly arbitrary, it was decided to solve the constraint 
equation directly for ^ ) . 


The pitch command angle, 7 ^ , was solved from the constraint equation as 
follows: 


First, it was necessary to have the constraint equation as an explicit function 
of the control variables. To do this, the constraint equation 


^ AO L h ~ 
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was redefined as 

l\) ~ Sir\ o( T ^ - L* - ® (2.12) 

without loss of generality, since for small allowable ®< T 


S t r c *(f 

Thus, the modified constaint equation is essentially unchanged. While on the 
constraining portion of the trajectory, 

5 L-r\ 

and a fundamental trigometric identity enables us to write 

5 tV, o( T - - l* 

or 

C.OS - \ ~ ^ 

The total angle of attack, Kt , is defined as the angle between the thrust 
and velocity vectors, which may be written vectorially as 

— . - 3 is the vector dot product 

Cos o< T - yj> <M • 

|V| | X„| ) J is the absolute value 

or explicitly in terms of the control variables this becomes 



tO 5 




- Xc xc-e 


WI 


By substituting the above equation into the modified constraint equation (2.12) 
and collecting terms, a quadratic in •< may be obtained. 


2. K t 


X + 1 ^ + (Xct J 


s* 


+ 


K 42 - (Xct + ts*)* ^ 
i* t (Xc-t 4 


- o 


(2.13) 


where 



Hence, the pitch command angle, ^ , may be determined as a function of T on 
the constraint. 
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To solve for the optimum yaw command angle, T , on the constraint, it is 
necessary to modify the second set of Euler -Lagrange Equations (2.8), The 
above Equation (2.13) is solved for % in terms of Z and substituted into these 
Euler -Lagrange equations. When on the constraint, these modified Euler-Lagrange 
equations may be solved explicitly to give 


- / 


a- 


- IX. - t/' U 


4 


\.' fr . ex' -1 ----- 

r'- Lt T at 4 3t 


- 0 (2.1U) 


The two equations (2-13 and 111;) may be solved by a Newton-Raphson iterative 
scheme to obtain the control variables 'f- and Z on the constrained arc. 

As described in the Section on Mayer Formulation of Problem, the entry 
time, t, , is that time at which c* 5 equals the constraint limit, L r . Thus, 
starting at time will be determined from the constraint equation. The 

criteria for determing the exit time, ly , will be established as follows: The. 

exit corner time will be defined as that time when chi solved from the constraint 
equation is equal ,to chi if solved from the Euler-Lagrange equation. Then, on the 
unconstrained arc III % and t are solved from the Euler-Lagrange equations (see 
Equation ( 2 . ll) ) . 


2.5 Transversality Conditions 

A method for determining transversality conditions was suggested in Hunt's 
paper (Reference 3) and involved construction of the matrix shown in Figure 2. 
The matrix form of the transversality conditions for the problem of Mayer was 
formulated as follows 


(a) 


(b) 


Top Row 

at J ~ 0 

for L, , i t , and 

First Column 

(A) Initial Conditions 

(B) Boundary Conditions 


(A) Independent Variable (t c ) 

(B) The Dependent Variables ( X/ ) 
(C> Next (A) and (B) are repeated 


(C) 

(D) 


0 , t /) of Mayer problem 

Special Case - Partial Derivatives of augmented function 
with respect to Xi . 


)Q + J- 

ho*J% 


(c) The partial derivatives of the Q column elements with respect to the 
row elements were calculated to complete the matrix. The blanks 
on Figure 2 indicate zero elements. The transversality conditions 
sire now determined from the augmented matrix by the relationship 
that all determinants of order Q + 1 must be equal to zero. 
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Figure 2 - Matrix Form of Transversality Conditi 


As the entry point, t, f and exit point, , the evaluation of the deter- 
minants results_in five independent transversality conditions relating the 
changes in the \s (denoted by A \ ) across the constraint boundaries. For 
the boundary points t, and t* » the TVC's are 

Y* kXj - ^y z O 
z O 

^ L V 5 - ^ A \ z - o ( 2 .l£) 

^ ~ ^ r O 

- %h \-o 

The Lagrange multiplier \ , associated with the mass_doesn't enter into 
the equations and was omitted from the analysis. Another \ may be set equal 
to one due to homogeneity of Euler-Lagrange equation. This leaves five X s 
( \i t 0 \n ) to be determined. Investigating the intermediate points, we find 
that an additional transversality equation is available at these boundary 
points, and • 


From the augmented matrix, the following relations must hold: 


■)v r O 
A) -- ° 


’\j ) : Sir, ~ 0 


Since the A\ s are not zero, these equations stipulate that 



n 


From the preceding section, it was seen that the constraint equation may 
be written as: . 4 

vy - Xcxer i- t cxs? - i sx - K - ° 


where 


K* =f\ 




} 


v 


(2.16) 


This equations yields 

! 4 ] X r -Xsxci - Xsx si - Icy, - ° 

~ X c X S Y ^ 

Simple manipulations of the optimum control Equations (2.11) give the following 
relations: 

X . X 

So< T ' t i Cos t- 


Therefore, 


y; >* i \ l 


T >\* x 



and 


st^ X - 




XI 


where 


Cos X = ~L& + 

fxi 


\l - \i + \1 * Vi 


This gives 


Sl-nxSi-nc = - 

'Al VTV + 


1 Si. 10 y cos y ~ - 


Cot yeast z 


}j_ 

W 


) Cos X $<:>■* 


\ 5 


\v\\Ty s + 


_V 

w 


Substituting the necessary relations into Equation (2.16) and solving simul- 
taneously, we obtain the following transversality condition valid on the 
unconstrained side of points t,, and t 2 . 

E 

X 


\q - V 


(2.17) 


It is interesting to note that this equation enables us to determine 
aX 5 or A X 7 at point t 2 from which the remaining A 1 s may be determined from 
Equation (2.l5). Thus, all the TVC 1 s at point t 2 are trivially satisfied. 


However, at the entry point, t,, the above TVC (Equation 2.17) is valid 
on the left unconstrained side of the point. Therefore, we must isolate on one 
of the A V 3 at point t, , from which the remaining h\'s may be determined from 
Equation (2.15). Equation (2.17) then remains as a transversality equation that 
must be satisfied at the entry point, t, . 


A specified .altitude and velocity was selected as end conditions for this 
particular example problem. Velocity will be the "cutoff" criterion leaving 
altitude as a transversality condition to be satisfied at the final time, i 3 . 
This set of end conditions gave simplified final TVC's focusing attention to 
the intermediate TVC's at t, and t 7 . The remaining final TVC's are obtained 
from the Transversality Matrix. The final TVC's are given in the following 


equations . 


TVC C l ) - 
TVCU) = 
TVC (3) - 
TVC 

TVC (5) = 


X^X. - 

x 4 x - 


> ts > P 

Xi Y ~ 


- o 


\i i 

= - > 5 y 


z O 

^ o 

z o 


(2.18) 


Xq Y - \<c «? 

Summarizing the results above, we find that there are five transversality 
conditions to be satisfied at the final time, t 3 , and one remaining transver- 
sality condition to be satisfied at the entry corner, t, . Thus it is necessary 
to have six adjoint variables that we can vary in order to satisfy these six 
transversality conditions. 


The total number of initial adjoint variables available at to is seven. 
However, one of these, namely X, , is associated with the vehicle mass and does 
not appear in the transversality conditions and is omitted. One other 
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may be set equal to one due to the homogeneity of the Euler -Lagrange equations. 
This gives a total of five initial 1 5 that must be chosen. The sixth adjoint 
variable free to choose is one of the &X S across the entry corner, t, . 

A systematic search routine is employed to determine the values of the 
five initial \'s and the &Vs across the exit point in order to satisfy the 
transversality conditions. By satisfying these transversality conditions, an 
optimum trajectory for the entire first stage flight results. 
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Appendix A 
REFERENCE FRAMES 


A description of the four coordinate frames used in the analysis is stated 
below. The relationships among these reference frames are shown on Figure 1 A. 

It may be noted that all reference frames are right-hand coordinate systems. 

1 . R-Frame This Reference-frame has its X R -axis through the prime meri- 
dian at the time of launch and its Zr -axis through the North Pole. The Yr -axis 
is oriented to form a right-hand coordinate system. 

2. I-Frame This is an Inertial-frame centered at the launch site with 
the Xj -axis in the negative direction of the gravity vector, G, and Zy -axis 
at a given azimuth from North. The orientation of the I-frame is found from 
the R-frame by the following sequence of rotation: 

(a) Rotation about the Zr - axis by the longitude of the launch 
point, 

(b) Rotation about the new Yr - axis in the negative direction by 
the plumbline latitude, - 

(c) Rotation about the new Jr - axis in the negative direction by 
the firing azimuth, -Azl$ 

The azimuth is measured in the plane normal to the local geodetic or plumb- 
line direction. 

3 . L -Frame This is the Local Horizontal ( octangent) -frame which is 
normal to the local geodetic or plumbline direction. The L-frame is transformed 
from the R-frame by the following rotations: 

(a) Rotation about the Z R - axis by the instantaneous longitude, ^ . 

(b) Rotation about the new Yr - axis by (- '/■' " , where ^ is the 

instantaneous latitude. 

I 4 ,. M-Frame The origin of the Missile-frame is at the center of gravity 
of the missile with the % - axis forward along the longitudinal axis of the 
missile. The negative Z M - axis is in the direction of aerodynamic lift. The 
transformation is obtained from the I-frame by the Euler angle sequence of 
rotation: 

(a) Rotation about the Xj - axis by the inner gimbal angle, . 

(b) Rotation about the new Z-j- - axis by the middle gimbal angle, C . 

(c) Rotation about the new Y" - axis by the outer gimbal angle, pt. 
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The three gimbal angles ^ , and % are the command attitude angles. 

They describe the orientation of the thrust vector in the I-frame. 

The transformation matrices from one frame to another are given below. 
These transformations are formed by successive rotations of the Euler angles 
described in the preceding paragraphs. The transformation from R-frame to 
I-frame, Tr 2 I, is time invariant (remains fixed from launch). 

- Cfl iu o Sv»c.C<|>.-SF*j U o s k’ - Cflju, Sl ^ +SPi jv.o''4><. 


T. 


m 


C « 4U) C vpo 


S 4 >o 

'^^W> C 'Vo 


C. N ct Po 


(A.l) 


T 


*l?2 L 


- S C $ 
- S 4> 

- ctydp 


- 

c 4) 

C(p 


-C'V s$ 

- s ^ 


(A. 2; 


Tj Z m 


c 7 C -t 

- 


c X s <•/ o ? 

Ct C 5 

sx stef - cxs$ 


r _ 1 1 


| 1 

r 1 

r "1 

i T : _ 

1 L2K 

i» J 

Ttzm 

l* J 

I 

* I 

T 

iK2lj 

. T 

■ L2 R 


CX St - sx 

n . ,* V- 

^ i >5 

ix :•/• c 5 


The inverse (transpose) of each matrix is 


T B2 


Bz R 


It 1 

i H2B J 


-I 


1 T 


! n*B 


since all of the transformations are orthogonal. 


(A. 3) 


(A.li) 
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SPHERICAL RELATIONS 
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ABSTRACT 


A three dimensional derivation is presented of the 
equations and boundary conditions necessary to deter- 
mine the minimum fuel orbit transfer path by optimizing 
the thrust direction and duration. The formulation, 
known as the Mayer problem in the calculus of variations, 
yields a two point boundary value problem. A Newton- 
Raphson method was used to attempt convergence of this 
two point boundary value problem, but it was found to 
be inadequate. However, with the final orbit unspecified 
numerous solutions satisfying the Mayer formulation 
were generated and then compared with the optimum 
two-impulse transfer between the same two orbits. 

This comparison is quite revealing; it shows first, 
that for the restricted class of orbits examined the 
optimum two-impulse estimate of velocity increment, 
or fuel required is very good. Second, it demonstrates 
that although the optimum departure and arrival points 
obtained from the impulsive and finite thrust solutions 
may be quite different, the penalty in using the former 
for design estimates may be quite minor. 
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INTRODUCTION 


In this report we are concerned with the problem of moving a vehicle 
between two arbitrary orbits in space. The orbits are assumed to have one 
planet as a common focus which generates a uniform central gravitational 
field, and the vehicle is assumed to be capable of thrust direction and on-off 
control. We present a complete derivation, in three dimensions, of the 
equations and boundary conditions necessary to determine the minimum-fuel 
orbit transfer path by optimizing the thrust direction and duration, and the 
departure and arrival points on the initial and final orbits. The Mayer 
formulation of the calculus of variations is used. 

We turn to optimization procedures for finding the transfer path for 
three reasons: First, the problem of realistic minimum fuel requirements 

for space maneuvers is one of extreme importance. Second, for the 
purposes of design studies based on impulsive transfer, it is necessary to 
know the error made by the assumption of impulses. Third, the optimiza- 
tion technique gives an organized and general way for finding a transfer 
path; it is a procedure that is of significance no matter what quantity is to 
be extremized, since it provides a suitable steering program to accomplish 
the desired mission. 

Selection of the optimization technique is primarily decided by what 
has been reported in the literature, and the experience of the investigator. 
Either the indirect method-use of Lagrange multipliers -or the direct 
method-steepest descent-can be used. Reference (2) reports a successful 
application of the Mayer formulation to the problem of boosting the 
maximum payload into orbit with a high thrust engine. Reference (3) also 
uses the same method successfully on the problem of coplanar orbital 
transfer with very low thrust engines. Both applications utilized the 
Newton-Raphson method as the principal iterative technique for solving 
the two-point boundary value problem. These reports were the main 
factors in this selection and in the initial approach to the two-point boundary 
value problem used in this study. 
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I. EQUATIONS OF MOTION* 


The kinetic energy per unit mass is: 

• Z 2-2 Z 2-2 
P = 1/Z (r + r 0 + r cos 0 0); see Fig. 1 

The potential energy per unit mass is: 


V = — (p = K M ) 
r ^ earth 7 


The Lagrangian, L. = P - V: 

L = 1/Z ( r ^ + r^0^ + r^cos^0 0^) + — 

r 

The three second-order equations of motion are obtained from: 

_d_ 1L _0L _ . __ , . 

dt 3 q. " 9 q. Q i* 1 1_ * 

1 1 


where the q^ are r, 0, and 0. The are the generalized force and 
moments due to the thrust, T: 


Q = — cos ib cos v 
r m 


= r sin \\) 

0 m 


Q = r cos ib sin v cos 0 

0 m 


Thus, the three second-order equations of motion are: 

*• . Z 2-2 UL T 

r - r 0 - r cos 0 0 + — = cos di cos v 

Z m 
r 


d , 2 • Z . -2 T 

— (r 0) + r cos 0 sin 0 0 = r sin ib 

dt m 


*See also references 4-6. 
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( 3 ) 


d , 2 2 •. T 

— (r cos 0 0) = — r cos dj sin v cos 0 
dt ' Y m 

We want the thrust, T, to be either on or off. Hence, we define T = c (3, 
where c = an effective exhaust velocity, and }3 = mass flow rate. 




ML 1 l L 

M 

Check dimensions: 

T 

F 2 ’ c(3 T 

T 

• 

T 


7T 

Expanding (2) and (3), and noting that we cannot have 0 = ± y , we get the 

following seven first-order equations of motion, where new variables p , x, 
y are defined as indicated: 




w i 

o 

II 

Q_ 

l 

• U 

lit 

(4) 



W 2 

= 0 - X = 0 

(5) 



W 3 

o 

11 

1 

III 

(6) 

• Q_ 
III 

£ 

Z 

- r x 

2 2 ul c (3 , n 

- r cos 0 y + -*E - - 1 - cos i)j cos v = 0 
Z m 

r 

(7) 

W 5 

Z 

E x + — 

p x + 
r 

Z ci3 

cos 0 sin 0 y - sin iJj = 0 

7 mr 

(8) 

w 6 E y 

- Z tan 

0 • 

Z p y c(3 cos ijj sin v _ Q 

x ^ r mr cos 0 

(9) 



w 7 

m 

3 • 

II 

o 

(10) 


The optimum path (for min. fuel expenditure) that is to be found must 
satisfy the equations of motion, and this is represented by constraints, 
w^ = o , i = 1 -7 . 

There is one further constraint to be added: We require the thrust to 

be on or off- -no throttling. This is expressed by: 

*8 - P m ax.> = ° 
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Hence, problem variables are: 


Dependent 


Dynamic and kinematic 


Control 


r 

0 

0 

m 


P 

x 

y 


V 

f3 


Independent 


t 


Denoting all dependent variables by z, the constraints can be expressed as 


w. = z. - f. (z .) = 0 i = 1 ~8, 
i iij 


1 -10 


II. DERIVATION OF OPTIMIZATION PROBLEM * 


A. Since the quantity that we want to minimize only enters in the 
boundary conditions (we use the Mayer formulation of the calculus of 
variations), let us first obtain the Euler-Lagrange equations associated with 
the control variables ijj, v, (3 . 

F = X i (t) w i (z^ Zj ) 


Require: 


d 9 F 


9 F 


dt 9 z. 9 z. 
1 x 


= 0 z. = v, ijj, (3 


9 F * 

a” 1 — = 0 •• -r— 

o z . o z 


9 X . w. 

L . J 


9 w . 


= X 


j 9 z 


1 = 0 


1 . ) z = 


= V 


X (^- cos ib sin v ) + X ( 

4 \ m ' oy 


- eft cos ib cos v 
mr cos 0 ; 


= 0 


cS I COS l)j COS V 

— — X cos ib sin v - X ~ 

m l 4 y 6 r cos 0 


= 0 


If (3 = 0, then T = 0 and ip and v have no meaning, and we simply compute 
the X^t) by a closed-form solution which is given in Appendix A. For 
(3 ^ 0, and c and m * 0 for all t: 


cos ijj | X sin v 


X cos v 

■ 0 

X 6 

/.Either i|> = ± — , or tan v = 


X . r cos 0 
4 


(in 


X 


.. sin V = ± r ^ , cos V = 


X . r cos 0 
4 

± D 


(12), (13) 


*See also references (7)-(9). 
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where 


D = 

ii * 


x 2 x 2 2 2 
X z + X . r cos 0 
v w o 4 


Note: From equations (7) and (9), that if ip = ± tt/ 2, the v terms drop out, 

as expected on physical grounds. 


2. ) z = ip 

X 4 sin 't' cos v | + X 5 |‘ 


mr 


c p , cp sin 4 j sin vl 

cos ib + \ , — 1 1 = o 

J 6 mr cos 0 


If (3 = 0, then the argument is the same as above. For [3^0: 
sin ijj 


X sin v 

r X , cos v + — 

4 cos 0 


- X cos 4^—0 
5 


Insert (12) and (13) for sin v and cos v, and collect terms: 

± D 


sin l|j 


cos 0 


- X 5 cos 4 j = 0 


X cos 0 

sm \\) 5 

= tan = — 

cos ^ ± D 


(14) 


3.) z = (3 


- c 


4 m 


COS cos v + X 


f — sin i(j| 

[ mr J 


+ X 


- c cos \]) sin v 
mr cos 0 


+ x 7 <i> + x 8 |« 3 -'wH -° 


(15) 


This equation yields X g, but it is of no significance in this problem. 

B. To reconcile the sign ambiguities in 1 . ) and 2. ), above, and to 

determine when 3 = 0, [3 = (3 , we turn to the Weierstrass necessary 

r max. 

condition. 
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1.) This condition states that for a minimum, E ^ 0: 


• ❖ 


E = F (Z'" , Z‘ ) - F (Z., Z.) 


(Z. - Z.) 


8 F 
i ; d±. 


Z? differs from by a finite , but admissible amount. 

The only variables which admit of such strong variation are v, p, and 
p, where, for example: 


ijj = p or \\i + it; v = v or v + tt; (3 = 0 or |3 


max. 


Now, the third term in E is identically zero since there are no constraints 

• • * 

involving v, (3. 

E = X.(t) w. (zj , zT) - X.(t) w. (Z., Z.) 0 j, i - 1-8 

E =K.(t) [z* - f. (Z*)] - x.<t) |z. - r(Z.)| 5 0 


E = X .(t) f. (Z.) - X .(t) f. (Z . ) > 0 

1 1 J 1 1 J 


or 


X.(t)f.(Z) ^ X.(t)f.(Z. ) 


Applying (16) we get: 
t c S 

X — cos di cos v 
4 l m 


r c8 1 x f cp cos 4j sin v 

+ X I — - sin di + X 5 

5 l mr T J 6 l mr cos 0 

-A. 

+ X 7 (-(J) + X (-pip - pi) * K 4 l^|- COS ^ C ° S 


(16) 


A A 


X , c P . . 

+ X „ | — — sin up 


5 l mr 


+ X 


c (3 cos 

sin v 

mr cos 

e 


+ x 7 ( - p' 1 ') 


❖ 


+ x - p (P - p ) 

o max 


Note, first, that the X g term = 0. 
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Now, factoring out a and ^ yields, in the notation of ref. (8): 

£ # 

(3 k - (3 k ^ 0 


X cos 4* sin v, 

6 -)- s 


where 

c / 5 

k = — (\ . cos ib cos v + sin ib + _ 

m \ 4 r r cos 0 

For k = k*, p * p*; k (p - p*) > 0 

If k > 0, then p > p* =$> P = P 

max 

If k < 0, then (3 < (3*=>(3 = 0 

Thus, we have the engine on-off criteria. 

For (3 = k ^ k ; 


X 


5 . 


X , cos 4 sin v 
o 


(17a) 

(17b) 


X . cos dj cos v + — 11 sin i b + — 1 — > k A cos i p cos v 

4 T r r cos 0 4 


^ 5 * 

+ sin ib + 

r 


X cos 4 / 

o 

r cos 0 


sm v 


(18) 


a. ) 4 = 4 ' j v ^ v 

Hence, (18) becomes 


v ~ v 


or V + TT (= v ) 


X cos ib sin v 
0 


X , cos ib cos v + 

4 r cos 0 


^ 0 


Using (1 2) and (13): 


COS 4 


1 2 2 Z a . V 2 

A ^ r cos 0 I X. 

4 6_ 

± D r cos 0 
v 


> 0 
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or. 


cos 41 


r ± d 

v 

r cos 0 


> 0 


Since r > 0, and— y- < 0 < > the above yields 


+ D , cos ib > 0 
v 


D , cos ib C 0 
v 


(19) 


Physically, we will most probably be confined to 


t < + < i=^ + D , 


b.) v — v ; ijj * ^ =^> 4 j =4 ,or 4 , + ' n '( = 4 j ) 
From (14) and (19): 


X cos 9 
tan cp = — ^3 


X _ cos 0 ±D 

5 . v 

sin \\> = , cos ip - 


±D 


±D 






( 20 ) 


where 


D 


4^ 




, ,2 2 * 
+ Xg COS V 


F rom (18) again: 


X cos ib cos y + 

4 r 


X sin ib X cos i(j sin v 

-5 + _6 > o 


r cos 0 
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Substituting (12), (13), and (20) and clearing yields: 


± D 




r cos 0 


> 0 


Again, since r cos 0 > 0, this requires + D 


(21) 


C. There is a first integral, since the Lagrangian, F, does not 
involve time explicitly. 


»F . _ » V(t) w. (i,, z ) 

W z k = C; 


0 z. 


*k =C 


0 (z - f. (Z.)) 

x i (t) FT" 4 k = c 

k 


Hence, 


X r + X 2 0 + X 3 p + X 4 p + X ^x + X ^y + X ^m = C (22) 


D. Boundary Conditions 


The boundary conditions to be applied come from two sources: Those 

implied by the physics of the problem, and the remainder from the 
transver sality condition 


d G + (F - 


3 F 



z ) dt + 

K 


0 F 



T 

= 0 , 

0 


(23) 


where G is the function to be minimized. 

1 . ) To clarify the derivation of the boundary conditions, let us first 
consider that the two orbits are coplanar. We reiterate the problem: Find 

the minimum fuel path to transfer between two coplanar orbits by optimizing 
the thrust direction (v) and duration ("Bang-bang" control). The departure 
and arrival points on the initial and final orbits are not specified, but the 
total time of transfer is specified. The geometry is shown in Figure 2. 


153 


* 
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FIG. 2. TRANSFER GEOMETRY 
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Thus, we have a system of 10 first-order differential equations for 
the variables: 


r, <P, p. y, m, \ 3> X 4> X 6> X ? 

This system thus requires 10 boundary conditions. The seven specified by the 
physics of the problem are: (i = initial, f = final). 

p. (or h.), e. (or E.), co. , m. 

*1 ' i i l i i 

p f (or h f ), e f (or E f ), w f (24) 

p, e, go are semi-latus rectum, eccentricity, and argument of perigee, 
respectively, h and E are angular momentum and total energy. 

We derive the three remaining conditions from Equation (23) and thus 
we are obliged finally to select the quantity to be optimized. Since we wish 
to compare our results with minimum impulsive orbital transfer, let us 
consider minimizing the characteristic velocity, 

m. 

G = c In — 


Equation (23) becomes, utilizing (22); 


-f— 5- + X„1 dm + X 1 dm. + - C dt + X dr 

m. 7 f m. 71 l 1 


+ X d<f> + X dp + X dy = 0 


(25) 


Since m^ is specified, drrq = 0. Also, dt] - 0, which implies C - 
unknow n . T hu s , 

X _ = — at t = T 
7 m 

This is our eighth boundary condition. The remaining two come from 
£ X j dr + X ^ d^ + X^ dp + X ^ dy j =0, 


(26) 


(27) 
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where we use orbit equations to relate the differentials in terms of the given 
parameters p, e, and w. To do this we note: 


r =•- ^-7- r = f( 0) (28) 

1 + e cos (0 - oo) r v ' 

dr = f 1 ( 0) d0 

E/m = l/2(r 2 + r 2 0 2 ) - ill (29) 


d(E/ m) 


rdr + 0 rdr + 


0d 0 



0 


h/m 


Z . 
r 0 


d(h / m) 


Zr 0dr + 


r 2 d 


<A = 


0 


(30) 


Expressing all the differentials in terms of d0, the two boundary conditions 
then are 


f'(0) 


2 V 


. 4 . • Z li 

X 1 -I -1 ' r * + "^ ) - r 

r 


+ \ = 0, at t 


0, T. 


These two equations can be put in a more revealing form. Substituting p and 
y from the equations of motion, we find 


fx i + jx 3 + p-x 4 + f x 6 = 


X , sin v 

6 


+ X . cos v 
4 


at t = 0, T 


Utilizing Equation 17 from p. 9, with ip - 0 and X ^ = 0, we see that the 
right side of the above equation is 


Pk + P X ? , or 


r X . + 0 X + p X + y X + m X - pk at t = 0 , T (31) 

1 3 tc 6 7 

This thus identifies the constant, C (Equation (ZZ))as equal to pk at the 
end points. 


Further, if C ^ 0 at t = 0, (31) implies that k(0) = k(T). 
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2. ) We can now proceed to derive, rather succintly, the boundary 
conditions for the three dimensional case. The problem requires fourteen 
boundary conditions since there are fourteen first order differential 
conditions for the variables: 

r, 6 , <f> , p , x, y, m, X^, \ \ 

The physics of the problem now yields eleven conditions while the trans- 
versality (Equation (23)) yields three, exactly as in the planar case. The 
additional four physical constraints are that the vehicle's position and 
velocity are to be in the specified initial and final planes. 

We list the fourteen conditions in terms of their origin: 

(a) From the final point (t = T), there are five: By choosing the final 
plane to have zero inclination the two additional constraints at the final point 
are simply 0(T) = 0 and 0(T) = 0. The other three are Equations (28), 

(29) and (30) applied to the final point. 

(b) From the initial point (t = 0), there are six: One of the six is the 

specification of initial mass, while five are orbit equations. The initial 
orbital plane is taken to have an inclination i and to have its ascending node 
on the xj axis as in Figure 2. The departure point angle called <f>j in 
Figure 2 is replaced so that <j> represents the angle in the Xj, x 2 plane as 
in Figure 1. The five orbital equations may be taken as: Equations (28), 
(29), (30), 


sin 0 = tan 6 cot i, 


(33) 


and 


2 2 h 

yr cos 0 = — cos l. 

3 m 

(c) From the transversality condition, there are three: 

\_ = — at t = T 
7 m 

is obtained exactly as before. The remaining two equations are: 

| ^ dr + ^ 2 d 0 + ^3 d<t> + dp + ^ 5 dx + ^ dy j =0 


and 


| dr + ^3 d$ + ^4 dp + ^6 dy j * ^ = 0 


(34) 


(35) 


(36) 


(37) 


157 


In addition it should be pointed out that just as in the planar case 
Equation (36) and Equation (37) are equivalent to 

(3(0) k (0) = C 

(3 (T) k (T) = C 


(38) 

(39) 


Finally, for use in computation it must be indicated that equation (36) 
along with the total differentials of the five orbit equations (28, 29, 30, 33, 
and 34) constitute a set of six homogenous equations, the determinant of 
whose coefficients is the required relationship. This is the generalization of 
Equation (31) for the initial point. For the final point the generalization is 
the same as in the planar problem. 


E. Corner Conditions 


The points at which the thrust goes on or off give rise to dis- 
continuities in the z^. The mathematical criterion needed to join 
different positions of the extremal arc is supplied by the Erdmann- 
Weierstrass corner condition: 



(40) 


( 41 ) 
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We observe that any of the seven conditions which comprise (40) would 
not apply if the value of the physical variable were specified at the dis- 
continuity. Similarly, (41) would not apply if the time of the discontinuity 
were specified. 

F. Euler -Lagrange Equations 

Here we write down the differential equations for the Lagrange 
multipliers, which come from the Euler necessary condition in the calculus 
of variations: 


d 

dt 


9 F 


9 z. 


9 F 

= °; z k = r, 0, <f>, p , x, y, m 


(42) 


F = \.w. = X.. (t) fz. - f. (z )1 
J J J l J J i J 


j / x 

Tt 6 jJ = - ‘V 


9 f. 

\ - - x j alf- ‘V 


(43) 


Using equations (4) - (10), equation (43) yields: 


• \ 2 2 2 2ix i 5 r c(3 sin ib 

\ - - h * + y cos 6 + -f - — 2 » x - 

r J r 


6 

2 


2 p y - 


c (3 cos sin v 
m cos 8 


(44) 


* 2 2 
\ - \ (- 2ry cos 0 sin 0) + \ y cos 2 0 

l ~r J 


- X, [2 x v sec^0 + -^-2- cos ib sin v tan 0 sec 0 I 
6 1 mr I 


S a 0 


(45) 

(46) 
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III. ITERATIVE METHOD 


The equations (4) - (10) and (44) - (50), plus the control equations for 
the switching function, k, and the steering angles, i \t and v, are a set of 
differential and algebraic equations whose boundary values at t = 0 and 
t = T must meet the specified conditions at those two points. We are thus 
faced with the well-known two-point boundary value problem. The Newton - 
Raphson method, and a "Matrix Modification" technique were selected as 
the first iterative techniques to attempt convergence of the two-point 
boundary value problem. Both these methods are fully explained in 
reference (2), and only a brief description of the convergence characteristics 
of this method on this problem will be given here. 

The iterative techniques have so far been only applied to the coplanar 
case because it was felt that until a fast and reliable method was available 
for that problem it was rather hopeless to tackle the three-dimensional 
case. Reference (3) reported success with this technique for low-thrust 
engines, but in this case when the thrust- to-weight ratio (T/W) is between 
one and ten, it does not seem to be able to handle the problem. One 
comment about a T/W of ten is in order; the iterative procedure begins 
by. first obtaining the optimum two-impulse transfer. We then have the 
optimum departure and arrival points, velocity increment necessary, time 
for the transfer, and initial and final thrust direction. Hence, if we 
assume an engine with a T/W = 10, we have almost an impulsive vehicle, 
and if the final time is set equal to the impulsive time for transfer plus 
the time necessary to burn fuel yielding a velocity increment equal to or 
slightly greater than the impulsive solution, we can expect that the 
finite -thrust solution will be very close to the impulsive solution in all 
respects. Once this one has been obtained, we can then proceed to 
decrease the T/W to 8, 6, 4, etc., obtaining solutions for all these, until 
we are down to precisely the engine in which we are interested. 

Now, the Newton-Raphson method applied to the coplanar problem 
has the behavior of converging on the transver sality condition first, 
equation (31), and then keeping that satisfied, move very slowly towards 
meeting the orbit conditions, p, e, and go. The conclusion, so far, is 
that the method is inadequate for this complex and sensitive problem. 
However, several modifications of the method, and its use, are being 
studied, and it may yet prove capable. If not, other iterative methods for 
handling the two-point boundary value problem are being studied, and will 
be tried if the Newton-Raphson proves conclusively unsatisfactory. 
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NUMERICAL RESULTS 


In the introduction to this paper, three reasons for turning to optimi- 
zation procedures for the solution of the minimum fuel orbital transfer 
problem were given. This section gives an indication of the answer to the 
second statement; i.e., the comparison with two-impulse orbital transfer. 
The answer is not conclusive since the switching function time history was 
restricted to one coast period, and the second burn period was terminated 
as soon as 


k (t) = k (0) ; see equation (31). 

Thus, a rather restricted class of initial and final orbits was considered; 
all orbit pairs intersected, and in'most cases the intersection was quite 
shallow. 


The following table presents some of the results gathered from this 
restricted comparison. The first column is the thrust-to-weight ratio at 
the initial orbit; for example, a vehicle of 1000 slugs mass, with fuel-flow 
rate, j8 , of 1 slug/ sec. , has a specific impulse of 300 sec. if the 
(T/W)^ = . 7118, at a distance of 6058 miles from the center of the earth. 

In the second column, the percentage difference in velocity increment is 
given; Vp = c In m i , and Vj is equal to the total velocity increment 

from the two-impulse minimization. Total A <f> y in the third column was 
computed as follows: 


Total A <l> = 


* 


i> F 



+ 


^f»F ~ ^f» I 


Thus it represents the total deviation in the departure and arrival points 
between this finite thrust solution — subscript F — and the impulsive solution — 
subscript I. The last column gives an approximation to the penalty in 
velocity increment, or fuel, if the departure and arrival point of the 
impulsive solution is used instead of the points specified by the finite thrust 
solution. This estimate was obtained in the following way: Reference (1) 

presents contour maps of minimum transfer velocity on a <f> plot. By 

differencing the value at {<f> i, j, <£f, i) with the value at (<£|, p, <f> f, p), and 
dividing by Vj, we obtain an estimate of the penalty in velocity, or fuel, that 
would be incurred. We emphasize that this is an approximation; but in view 
of the results in the second column, it is probably a reasonable one. 
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Finite Thrust Versus Two Impulse Comparison 


(T/W)i 

<VF - Vl) 102 
Vi 

Total A <f>, deg. 

1 0^ Penalty 
Vi 

10 

.086 

20.0 

.135 

10 

.136 

26.3 

.410 

8 

.203 

21.1 

. 352 

8 

.236 

27.9 

.401 

6 

.143 

18.7 

. 365 

6 

.501 

29.8 

. 685 

4 

.278 

34.4 

. 874 

4 

. 354 

32.2 

.247 

2 

.224 

24.8 

. 611 

2 

.293 

28.7 

.631 

. 7118 

.095 

72.8 

1.89 

. 7118 

.194 

13.0 

.407 


We observe from the first and second columns, that if orbit transfers 
with realistic vehicles are restricted to be completed in one orbit, then the 
time constraint — obtained from the impulsive solution — placed upon these 
finite thrust solutions is also realistic, and, ipso facto, the fuel requirement 
for the transfer obtained from the two-impulse solution is a very good 
estimate of that which would actually be needed. This is, of course, with 
the assumption that the finite thrust transfer vehicle departs and arrives at 
the proper point, for we see that the discrepancies in if> £ and <f> f can be quite 
sizable. However, from the fourth column, we note that the penalty in fuel, 
or velocity, for using the optimum <f>£, <f > f from the impulsive solution rather 
than those specified by the finite thrust solution may be quite minor; however, 
this was a rather restricted comparison, and a good deal more numerical 
results are necessary before any even tentative generalizations in this 
direction are possible. 


163 


CONCLUDING REMARKS 


The Mayer formulation of the calculus of variations has been used to 
derive, in three dimensions, the equations and boundary conditions necessary 
to determine the minimum fuel orbit transfer path by optimizing the thrust 
direction and duration, and the departure and arrival points on the initial 
and final orbits. The closed-form solution to the Euler-Lagrange equations, 
which apply along the coast arc has also been derived, rather explicitly, and 
has been verified by some of the numerical integrations indicated in the 
preceding section. 

The numerical results section is considerably leaner than desired. 

One conclusion, therefore, is that the multivariable Newton-Raphson 
iteration technique is inadequate for this complex and sensitive problem. 

This is a useful, albeit frustrating result. A more gratifying result is the 
favorable comparison of two -impulse and finite thrust orbit transfer 
solutions. Restrictive as it is, it should be of interest to design personnel, 
for it is the first proven indication, to this writer’s knowledge, of the real 
utility of the impulsive solution and how much a design based on it differs 
from the optimum. 

It is hoped, and rather optimistically felt, that one of the iteration 
techniques currently under study for solving the two -point boundary value 
problem will be effective in this endeavor. With this accomplished, an 
unrestricted variety of problems with an equally unrestricted genus of 
propulsion systems will be able to be expediently solved. The two -impulse 
solution is obviously not universally a good estimate for design, or even 
applicable. When low-thrust ion or nuclear propulsion systems are being 
considered, and interplanetary transfers are being studied, it will be 
distinctly advantageous, if not imperative, that the capability begun herein 
be a reality. 
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APPENDIX A 


SOLUTION TO EULER- LAGRANGE EQUATIONS DURING COAST 


With the thrust off ((3 = 0), the equations of motion are 


■■ *2 M- 

r = r * - 2 

(Al) 

r 

r0 - -2 r0 

(A2) 


for coplanar orbits. The solution to these involves four arbitrary constants; 
p c , e c , u> c - the elements of the coast orbit - and (p c , the angle at which the 
coast is begun. 

The Euler-Lagrange 

equations are: 


x 4 

ZV ^ <j> 

= ~ X 1 + r 

(A3) 


2\ 6 r 

_-\ 3 -2\ 4 r 0+ r 

(A4) 

X 1 

= T [ C - X j - X 4 J ■ X 6*1 

(A5) 

s 

= 0; X. = X 7 at beginning 

of coast 

(A6) 

First, change the independent variable from t to 


1 

\ 

Z\ ,<j> 

• 0 

4 * = ~ X 1 + r 

(A7) 

1 

6 * = " X 3 “ 2r ^ X 4 + T X 6 

(A8) 
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Putting (A5) in (A7), and collecting terms, yields 


X .dr - X ,'r - X + C - ^ 

4 r 4 3 

The solution to (Al) and (A2) is given by 

2 • 

r (p = h = 


(*? - ♦) 


n/h- P. 


COS ( 0 - W ) 

c r c 


0 


W e find r by 


2 

* 1 * * 

r = 0 e sin ( 0 - oo ) 

p r c c 

c 


h e 

c 



sin (0 - co ) 
c 


From (AZ), 



0; thus (A9) becomes 


X 


X 4 r 



_c_ 


Defining true anomaly as 0 = (p - w c , and using 0 as the independent 
variable, we get upon substituting the equations of motion solution: 


x 

d0 ^ 4 


cot 0 


S p 

h e sin 0 ,2 . . n r , , 

h e sm 0 |1 + e cos 0 


C p' 


where the subscript c is now omitted. 

Substituting the orbit solution in equation (A 8) we get 


d X , X , 2 e sin 0 

6_ _6 

d 0 1 + e cos 0 



2 

h [1 + e cos 0] 


2 p X 4 (0) 
1 + e cos 0 


We note the singularity in this equation at 0 = 0, tt, and that the limit 
approaches ± oo on opposite sides of the singularity; the handling of this 
discussed below. 


(A9) 


(A10) 


( Al 1 )* 


(Al 2) 


is 
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We obtain the solution to (All) first. The homogeneous equation is 


- / 


cot 0 d 0 


\ = K sin 0 

4 1 


Applying variation of constants, we insert (A13) into (All), letting - 


K x (0). 


kJ(o) 


2.2 -.2 

sin 0 sin 0 [1 + e cos 0] 


S p 

where , C., 


- C P 
' h 2 e 


K (0) = ~C cot 0 + C - C f 

1 1 3 2 J sin ^ 0 rj 


d0 


[1 + e cos 0] 


-2 2 

Letting u = [1 + e cos 0] , dv = esc 0 d 0, we get 


J ud v = -cot 0 [1 + e cos 0] 


cos 0 d 0 
[1 + ecos0]^ 


Using ref. (10), we find 


2 e 


/■ 


cos 0 d 0 


3 ” z 

[1 + e cos 0] (1 - e ) 


sin 0 


(1 + e cos 0) 


/■ 


[-2 e + cos 0] d 
[1 + e cos 0] 2 


Multiple use of #317 and # 309 in ref. (10) yields 


/• 


d 0 


1 


2 , 2 , 

(1 + e cos 0) (1 - e ) 


•e sin 0 


Z J_ 

2 - 1 \j 1 - e tan 2 

— tan — ; 

2 1 + e 


1 + e cos 0 \] 1 - e 

where -tt < 0 < u and 0 ^ e < 1 - elliptical transfer orbits only 


(A1 3) 


(A14) 


0 
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Again, using #315 and #309 we obtain 


/ 


(1 


cos 0 d 0 
+ e cos 0) 


Collecting terms we get: 


/• 


d 0 


2 r , 2 

sin 0 [l + e cos 0J 


cot 0 [ 1 + e cos 0] 


-2 


(1 - e ) 


sin 0 


2 e 


(1 + e cos 0) (1 - e ) 


- e sin 0 
1 + e cos 0 


x 4- 


tan 


r 2 

-1 V] 


e tan 1/20 


1 + e 


sin 0 


1 - e 


2 \ 1 + e cos 0 


2 e 


4T7 2 


tan 


-i A - 2 


e tan 1/20 
1 + e 


+ C = L + C 
4 4 


K 1 (0) = - C 1 cot 0 + C 3 - 'C 2 cot 0 [1 + e cos 0] 


-2 


1 - e 


sin 0 


sin 0 


(1 + e cos 0) Z (1 - e 2 ) (1 + e cos 0) 


[2 e + 1 ] 


6 6 tan" 1 (ARG) I + C 


n V /2 

(1 - e ) 


»] + c 4 ) 


Defining the constant a , we have: 


;i 63 


v e) = 


C cos 0 + K sin 0 - C sin 0 (- cot 0 [l + e cos 0] 

X 1 £ \ 


-2 


sin 0 


sin 0 (2 e +1) 


1 - e“ |(1 + e cos 0) (1 - e ) (1 + e cos 0) 

- „ V3/2 < ARG) I) 

(1 - e ) 


(A 15) 


where 


ARG = 


. Vr 2 


e tan 0/ 2 


1 + e 


and K, is determined such that X A (0 - u> ) = X. (0 ) is satisfied. 

1 4 r c c 4 c 

Turning now to equation (A 12), we have for the homogeneous solution: 

-2 

X = K (1 + e cos 0) 
o 2 

Using the form (A14) for Kj (0) in the equation for X^ (0), substituting the 
homogeneous solution for X^ (0), above, into (A 12) and considering that 
K 2 = (0), yields the differential equation for (0): 

i - 2 1 - 2 - 1 

K (0) [ 1 + e cos 0] = - C [ 1 + e cos 0] - 2 p [ 1 + e cos 0] 

Ct X 


C d 0 

- C cos 0 + C sin 0 - C sin 0 / ? 

^ ^ sin 0 [ 1 + e cos 0] 






Thus, 


(0) = - - 2 p [1 + e cos 0] 


sin 0 


/ 


[- c , 


d 0 


cos 0 + sin 0 


sin 0 [ 1 + e cos 0] 
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Now: 


/[" C 1 " 2 P t 1 + e cos (• C i cos 0 + c 3 sin 0 )J d 0 


1 3 Z 

= 2 p (C sin 6 + C cos 6 + ~ — sin 0 cos 0 - — “ — sin 0) + C (A16) 


Finally, we need: 


2 p C 2 ^sin 0 [ 1 + e cos 0] ^ j— 


2 r 1 3 

in 0 [ 1 + e cos 0] 


u = f — — ~ ; dv = sin 0 [ 1 + e cos 0] d 0 

sin 0 [ 1 + e cos 0] e . 2 

L J v = - cos 0 + — sin 0 


f , r [ - cos 0 + e/2 sin 0] 

ludv = uv - / — 2 — d0 

^ sin 0 [ 1 + e cos 0] 


r cos 0 d 0 e f d 0 

udv - uv + / J “ T /r 77 

sin 0 [ 1 + e cos 0] 1 + e cos 0 


For the first integral let 

*%> 

y - e cos 0 


dy = - e sin 0 d 0 
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/• 


cos 8 d 6 

2 r , 2 
sin 0 [ 1 + e cos 0] 


= - e 


/ 


y d y 

, , 2 . 2 2 3/ 2 

[ 1 + y] (e - y ) 


Now, let 

z = 1 + y; 

2 2 , , , 

y =Z - 2 Z + 1 

Then: 


/- 


cos 0 d 0 


Z ,2 

sin 0 [ 1 + e cos 0J 


= - e 


/ 


(z - 1 ) d z 

2 _3/2 
z Z 


where 


Z Z 

Z = - z + 2 z + e - 1 , 


z = 1 + e cos 0 


Using reference (10), #190 and #197: 

cos 0 d 0 f d z 


/- 


Z r , 2 

sin 0 [ 1 + e cos 0J 


= - e 


/- 


z Z 


3/2 


+ e 


/' 


d z 


2 „ 3/2 
z Z 


(e - 1) 


I' 2 (‘ + * + e 2 '- l) ' 

C + ^) 


dz / 3 


I 


d z 
„3/2 


z Z 
+ C 


1/2 


(e 2 - 1) 


where 


/- 


d z 


z Z 


1/2 ~ / 


1 . -1 /z 

— T Sln \" 

1 - e ' 


+ (e - l) 1 
z e , 


and 


/■ 


d z z - 1 


z 3/2 e 2 Z 1/2 


) 

(A 18) 
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Thus, from previous results in X ( 0 ) an d collecting the above, we find: 


K (6) = (A 16 ) + 2 p C 7 uv + (A 18 ) - — 


(1 - e ) 


- e sin 0 
1 + e cos 0 




tan 


-i A- 2 


e tan 0/ 2 


1 + e 


+ C. 


(A 19 ) 


where u and v are defined below equation (A 17 ). 


Collecting terms, we get: 


(0) = [ 1 + e cos 0] 

6 

e C 


e c i 

K., + 2 p (C^ sin 0 + C ^ cos 9 + — — — 


sin 0 cos 0 


2 -sin Z 0) +2pC z |[L+C 4 ] [- cos 0 + — sin 2 0] 


(e - 1) 


+ 3 


m (' * 777 ) 


2 (1 - e ) 


e sin 0 
1 + e cos 0 




2 tan -1 (ARG) 


We note that the constants C3 and C4 appear explicitly in (A20). To 
eliminate this, we consider all terms containing them, namely: 


e C 


2 p ( C cos 0 


~ sin 2 0 j + 2 p C 7 C A (- cos 0 + — sin 0) 


2 4 


2 . 2 
= C„ (2 p cos 0 - p e sin 0) - C C (2 p cos 0 - p e sin 0) 

J L> t : 


(A 20 ) 


— 2 
= p (2 cos 0 - e sin 0), 

where Kj is the constant we determine from the initial conditions on \^(0). 
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' (9) - [ 1 + e cos 0] ^ + 2 p Cj sin 0^1 + 

/ 


e cos 0 
2 


+ p (2 cos 0 - e sin^ 0) + 2 p ^ - cos 0 + sin^ 0 ) 

+ 7Ti [ ' I^TT (‘ + V + 777) - / ^71 (‘ + 7^7) 


* 3 fz>' 2 ( 1+ a 2 -i)l + 


- e / - e sin 9 , 

2 (1 - e 2 >\‘ + ecOS 6 


1 - e 


1 - e tan 0/ 2 
1 + e 


We note that equation (All) has a singularity at 0 = 0 or it ( 0 = <*> or 
0 = oo + it). If it is necessary to evaluate across either of these points, 
we have, from the first integral (A 5), a solution. 



where the upper sign is used for 0 — w c (0 = 0), and the lower for 

0— oj + tt(0=tt). 
c 

We thus find, from (A 5): 
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(A22) 


lim 

0 — 0 
0 — it 


x 4 = 


c -\3<fr 
r 


lim 

0 — 0 



C p" 

e /x( 1 + e ) 2 


X 3 I 1 

e#t 


We can derive (A22) in a different, and more fruitful manner. Rewrite 
equation (All) as: 

dX 4 1_ f >3P Cp^ 1 

d 0 sin 0 X 4 he h 2 e p 4- e cos 0 ] 2 J 

Since we require continuity of the multipliers, the bracketed quantity must 
approach zero just as sin0 does as 6 — Solving, then, for X 4 at 6 =0, 
gives: 

Cp 2 X 3 h 

4 “ eft ( 1 + e ) 2 e ft 

Thus, we know that 

lim 

0-^0 dX 4 t 0 

0 — w d 0 0 


We can thus use L'Hospital' s Rule and derive two approximate 
differential equations for X 4 (0). In the neighborhood of 0 = 0, 


dX 4 

d 0 


= - X 4 0 


2 Cp 2 0 

ft ( 1 + e ) 3 


In the neighborhood of 0 = ir , 


dX 4 

"d 0 " 


= -X 4 (0 - ir) - 


2 Cp 2 (0 - ir) 

mo 


Solying these two equations, we obtain: 


X 4 (0) = 
0 ~ 0 


2 Cp 2 
ft(l + e ) 3 


+ K 3 exp 



X 4 ( 0 ) = 
0 ~ it 


-2 Cp 2 (0 - it) 
ft(l - e ) 3 


+ K 4 exp 


Ml-']) 
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We can similarly approximate (A 12), and obtain 



X 3P 2 4Cp 3 

h ( 1 + e)^ #t(l + e)^ 


where erf ( 0 ) is the error function, or probability integral: 

, c e 

erf (0) 


Jf 

2_ f e “ 

"VF J 


(fM l-fl) [-n£ 


du 


e / A lf 6 

^7 exp 1 — 2 “ 


X6<«) = exp 

0 ~ v 

l 1 + 2~ (»-«•>] + e'/* U P - ^) 3 ex p( A l® f’-'f']) 




where 


A i — 


- 2e 


1 " 1 - 


A 


2 " 


1 - 3e 
1 - e 


Since we do not have the switching function, k, as an explicit function 
of 0, some iterative method is needed to find the first 0 at which k crosses 
from negative to positive values. Simply using two points and a slope to find 
a parabola for extrapolation works quite well. Writing k as: 


k 



X 4 r 

D 



where 


D 

v 



2 

r 
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and 


we find 


4 d 0 


e sin 0 
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Summary 

A study has been made of minimum-fuel transfer and rendezvous 
between neighboring low-eccentricity orbits by power-limited rocket. 

This study includes and extends previous work wherein only the case of 
transfer between circular orbits was considered. As before, the 
analysis is based on the assumption that only small deviations from an 
initial orbit are allowed. Complete analytical solutions are obtained 
in three different sets of variables: (l) rotating rectangular coordinates, 

(2) rotating spherical coordinates, and ( 3 ) Lagrange's planetary 
variables. In addition to the determination of optimal transfer and 
rendezvous trajectories in three dimensions, synthesis of the optimal 
controls is also carried out in each case. The guidance coefficients 
resulting from the control synthesis are presented both in graphical form 
and in equation form suitable for use in guidance applications. 


Introduction 

It is characteristic of high-specific -impulse, low-thrust pro- 
pulsion systems that the source of power is separate from the thrust 
device itself. Consequently, such propulsion systems are referred to 
as power-limited, since thrust is restricted in magnitude by the output 
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of the power supply, which is in turn limited by the necessity of 
minimizing power supply weight. 

The problem of transfer and rendezvous between neighboring orbits 
by a power-limited rocket is of interest for two basic reasons. First 
of all, the problem can be solved analytically, as was demonstrated in 
Refs. 1 and 2, provided that the thrust acceleration is not constrained 
in magnitude and that the proper simplifying assumptions are made in the 
mathematical model of the system. The analytic expressions thus obtained 
for the controls and for the optimum trajectories then provide insight 
into more general problems where the simplifying restrictions are lifted. 
Secondly, the solution to this problem provides a lower bound to the 
performance requirements for low-thrust orbital transfer and rendezvous. 

It is interesting to note that if, for the same system model as has 
been used herein, the thrust acceleration is assumed constant, analytic 
integration of the equations of motion requires the evaluation of 
incomplete elliptic integrals of the third kind (Ref. 3 ). Therefore 
allowance for variable -thrust acceleration is essential if simple analytic 
solutions are to be obtained . 


Analytical Method 

Description of the Mathematical Model 


The phrase ,f neighboring orbits/ 1 as defined here, requires that the 
inclination between orbit planes be small and that the radial separation 
between orbits be small relative to the semi -major axis of either orbit. 
If it is further assumed that motion in the transfer orbit does not 
deviate significantly from these neighboring orbits, linearization of 
the equations of motion is permissible. 

The analysis has been carried out in three sets of variables: 

(l) rotating rectangular coordinates, (2) rotating spherical coordinates 
and, ( 3 ) Lagrange’s planetary variables. The rotating coordinates have 
been utilized previously in Refs. 4 , 5 , 6 , while the planetary 
variables were applied to an orbit transfer problem in Ref. 3 • 

The rotating coordinate systems are depicted in Figs. 1 and 2. 

Each consists of an origin which revolves at satellite velocity in the 
initial (interior) circular orbit and orthogonal coordinates measured 
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from this revolving origin. In the rectangular system of Fig. 1, y* 
is a radial dimension, x’ is measured tangent to the initial orbit at 
the origin, and z ’ is a coordinate which is out of the plane of the 
initial orbit and is normal to both x 1 and y' . 

In Fig. 2, the spherical system is composed of a radial coordinate 
y, an arc x, measured circumferentially from the origin, and another 
arc z, which is orthogonal to the x-y plane. 

The Lagrange planetary variables, which are derived from the 
elements of an elliptic orbit and are used in the standard variation 
of parameters equations of celestial mechanics (Ref. 7), are convenient 
because they eliminate the necessity of treating singularities for 
zero eccentricity and zero inclination in these equations. As they are 
used in this study, the planetary variables consist of the non- 
dimensionalized semi -major axis x x = a/a 0 , a circumferential distance 
component, X 4 , and the following combinations of the remaining 
orbital elements: 


x 2 = e sin u) 

x 3 = e cos u) ( 1 ) 

x^ = sin i sin 0 
Xg = sin i cos Cl 

where e is eccentricity, u> is the longitude of peri-apsis, i is orbital 
inclination, and Q is the longitude of the ascending node. The planetary 
variables provide a simple means of introducing eccentricity into the 
terminal orbits, and the form of the state equations using these variables 
is particularly simple in the present problem. However , in a practical 
application, they might be less desirable than the rotating coordinates 
because the orbital elements cannot be directly measured . 

In view of the foregoing considerations, eccentric terminal orbits 
have been allowed only in the planetary variables in this study, while 
the analysis in rotating reference frames is confined to circular 
terminal orbits . 

It should be noted here that the three sets of variables are 
entirely equivalent in that the equations of motion may be transformed 
directly from one set to another by substitution. There are some 
differences in the required linearizing assumptions which should be 
mentioned, however. 
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Consider the coordinate system depicted in Fig. 1, a rectangular 
system with its origin fixed on the interior orbit (assumed to be the 
reference orbit) in the x’, y' plane. The mutually orthogonal coordi- 
nates x' , y ' , z ' form a triad that revolves with angular speed no 
characteristic of the reference orbit, so that motion in this frame 
of reference is relative to a point on the reference orbit. The 
spherical coordinate system in Fig. 2 is described by the arc x in the 
plane of the reference orbit, the arc z measured normal to this plane, 
and a radial dimension y. 

In order to linearize the equations of motion in the first system, 
it is necessary to assume that excursions x', y’, z' from the origin be 
small in comparison with the radius, r 0 , of the reference orbit. Motion 
is therefore constrained to a small sphere about the origin. No 
restrictions are placed on the component velocities. In the rotating 
spherical system, only the assumption of small component velocities 
will linearize the equations, whereas the arc x is not limited. The 
resultant motion is constrained to a torus about the reference orbit. 

Since the linearized equations of motion are identical except for 
differences in notation (Ref. 4), one can draw the conclusion that, if 
in the spherical system the resultant motion does not involve large 
variations in x, the velocity components may be large. In the present 
study, use of the spherical system has been assumed throughout, and the 
results may be extended according to the foregoing discussion. 

In the case of the planetary variables, the linearizing assumptions 
require that the difference in the semi-major axes of the terminal orbits 
be small and that the eccentricity of the terminal orbits as well the 
eccentricity of the instantaneous transfer orbit be small. The 
implications of these assumptions are similar to those for the rotating 
spherical system, in that "fast" trajectories are allowed only when the 
linearizing assumptions may be relaxed. On the other hand, fast 
trajectories are allowed in the rectangular system because no limits 
are placed on the component velocities in the linearizing process . 

Analysis 


The optimization problem is to derive the optimal control equation 
for the minimum-fuel transfer or rendezvous of a power-limited rocket 
between neighboring orbits in a given time. Mathematically, this 
requires minimization of the integral 
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( 2 ) 


T T 

J = J tf (T/ni) 2 at= ]’ f (n 0 /a) A 2 dT = [ f f 0 (A)dT 

0 o fJ 0 c o 

subject to constraints imposed by the equations of state which may be 
expressed in the form 

x t = fi (x. A) i = 1, . . .,n (3) 

The control is the thrust acceleration vector. A, in the present case. 

The problem is treated as a problem of Lagrange in the calculus of 
variations. In particular, Breakwell's formulation (Ref. 8) of this 
problem is used because the linearized equations in the present case are 
particularly well suited to this formulation. 

If a fundamental function F is defined as 

f = -f 0 + r (4) 

i=l 

the variational treatment requires satisfaction of Euler -Lagrange 
equations in the following form as necessary conditions for the 
existence of an extremal arc: 


d\j 

dF 


dT~ 

dx t 

(5) 

^F 

XAj = ° 


(6) 


An additional necessary condition provided by the Pontryagin Maximum 
Principle must also be satisfied to insure that the stationary solution 
predicted by the Euler equations is actually an extremum. The maximum 
principle, which may be expressed as 

F (x t , X if A*)SF (x t , Xj, Aj ) (7) 

ensures that the stationary solution is an absolute maximum. Further- 
more, it has been shown (Ref. 9) that for a system where both the state 
variables and the controls appear linearly in the state equations, the 
maximum principle is also sufficient to ensure a minimum of the payoff, 
J. Since all cases in the present analyses are linear in the controls 
and satisfy the maximum principle, the optimum trajectories described 
herein are absolute extrema. 
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Due to the great number of equations involved, the variational 
analysis is not described in each case. Only the most important 
equations are included, and these are grouped in an orderly fashion in 
the appendices. The rotating coordinate systems are considered in 
Appendix I, and the planetary variables are considered in Appendix II. 

For a more detailed account of the application of the aforementioned 
equations the reader is referred to Ref. 1 wherein a specific case is 
treated in detail. 

Synthesis of the Optimum Controls 

In order to put the equations for the optimized controls into a 
form compatible with guidance requirements, several changes are made. 
First, t in the control equations is replaced by -t . That is, the 
equations are rewritten with "time-to-go" as the independent variable. 
Secondly, while in the ordinary transfer and rendezvous analyses in 
rotating coordinates it was generally convenient to assume zero initial 
conditions, the terminals are reversed in the control synthesis. That 
is, the target orbit is assumed to be defined by zero values in most of 
the state variables. The results of the control synthesis are expressed 
in terms of the guidance coefficients, ?>A i /ax t , of each component of the 
control vector, A. 

The equations for the control synthesis are summarized in Appendix 
III, for transfer and rendezvous in each of the coordinate systems. 

Those equations which deal specifically with transfer between circular 
orbits have been presented previously in Ref. 2. 


Results 


Orbit Transfer and Rendezvous 


The multiplicity of solutions generated in this study (particularly 
for rendezvous) precludes a graphical presentation of all the resulting 
trajectories. An attempt is made to summarize the results in a reasonably 
concise form with orbit transfer solutions represented as special cases 
of rendezvous wherever feasible. 

To simplify the presentation of the results, only circle-to-circle 
transfer and rendezvous cases are examined in the summary curves of 
Figs. 3 through 12. The first set of plots. Figs. 3 through 5, shows 
the variation of the components of the optimal thrust acceleration with 
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time for circle-to-circle transfer only. 


The in -plane components A x /y f and A y /y r are' seen to display symmetry 
about the midpoint in time for all trip times, as does the out-of r plane 
component A z /r 0 i. In particular, when T f = 2nn, the components A x /y t 
and A y /y f are constant with time, and the latter is zero. For the 
c oplanar problem, constant circumferential thrust acceleration is there- 
by specified as the optimum mode for integral multiples of the period 
of the reference orbit, a result that is in agreement with Ref. 6. 

Figures 6 through 8 show the thrust acceleration components for 
circle-to-circle rendezvous at a particular trip time equal to one 
sixth of an orbital period of the reference orbit. The parameter 
in Figs. 6 and J is x, /y f T f which takes on the value 3/4 for the 
special case of optimum transfer. Similarly the out-of-plane component 
is plotted with G f as a parameter. As indicated, the longitude of 
the node can have either of two values, 150 or 330 deg, for optimum 
transfer . 


The payoff, J, can be best represented as the sum of three 
components, , J 2 , and J3 , which are defined by Eqs . A-44 and A-45 
and are plotted in Figs. 9 through 11. The components J x and J 2 
define propellant requirements for coplanar rendezvous, while the 
addition of J 3 introduces the out-of -plane requirement. In particular 
J is equal to J x for coplanar transfer since the term x f /y f T r - 3/4 in 
J 2 is zero for optimum transfer. 

All three components, as well as their sum, are seen to be 
monotonieally decreasing functions of T f . In the limit, as T f -* =°, A 
and J - 0. This is a consequence of the fact that no limit has been 
placed on exhaust velocity. Similarly all three components tend to 
infinity as r t approaches zero because zero trip time requires infinite 
thrust acceleration. 


An interesting feature of J3 is evident from Fig. 11. For T f = kn 
where k = 0, 1, 2, ..., J 3 is the same for all nodal longitudes, G f . 

For all other times the envelope of the family of curves is given by the 
equations 

j (8) 

'■' 3 m a x “ T f - Sin T f V ' 


J 3 m i n 


+ | sin T f 


(9) 
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where the lower envelope is given by Eq. 9 and represents J 3 for 
optimum transfer. 

Application to Planetary Orbits 

Strictly speaking, none of the planetary orbits are "neighboring 
orbits" in the sense in which this term has been defined. Earth's 
closest neighbor, Venus, has a semi -major axis, a = 0.7233 AU, compared 
with a = 1.0 AU for earth, leaving a separation distance of 0.2767 AU 
which is not « 1.0 AU. However, it is possible to apply the linearized 
analysis to earth-Venus trajectories with remarkably good accuracy. 

In Fig. 12 a comparison has been made with the exact solutions of Ref. 

10 , for earth-Venus transfers . The circled points were calculated 
from Eq. A-V 3 of Appendix I. These results for the special case of 
uninclined circular terminal orbits show only a slight discrepancy in 
J for transfer times up to one earth year. 

To obtain the circled points in Fig. 12 a reference orbit mid-way 
between the two terminal orbits was selected, i.e., a = 0.8617 AU. This 
improves the accuracy of the results over what could be obtained by 
referencing the coordinates to the major axis of either terminal orbit. 

These results are encouraging and tend to support the view that 
an extension of the linearized analysis may be adequate for transfer 
and rendezvous between the orbits of earth and the nearby planets. 

Such an extension need not even be an exact second-order solution but 
might include only the dominant second-order terms in the equations of 
motion. This possibility is currently being explored by inclusion of 
the second-order terms in the radial motion. 

Control Synthesis 


In this study it has been possible to express each of the components 
of the optimal control vector. A, as a linear function of the n state 
variables . 


a, =8 ?h x 

i=l dx t 


1 


( 10 ) 


Therefore the presentation of the results can be confined to curves of 
the guidance coefficients, SA J /dx 1 plotted against time to go, t* . 

Using the equations for the guidance coefficients which comprise 
Appendix III, the summary curves of Figs. 13 through 2b were generated. 
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The synthesized controls for the case of transfer between an 
arbitrary state and a nearby circular orbit appear in Figs. 13 through 
15 in terms of the rotating coordinate system variables. The extension 
to include eccentricity of the final orbit is provided by use of the 
Lagrange planetary variables in Figs . 1 6 through 18 . 

For rendezvous the same procedure is followed in the presentation 
of the synthesized controls, with the addition of curves to account for 
the dependence of in-plane thrust acceleration components on the 
circumferential distance. In rotating coordinates, Figs. 19 through 21 
s umma rize the results for rendezvous between any initial state and a 
point on a nearby circular orbit. 

As in the transfer case, the planetary variables facilitate the 
extension to rendezvous between an initial state and a point on a nearby 
orbit of low eccentricity. The results for the planetary variables 
appear in Figs. 22 through 2b. 

All the curves for the guidance coefficients display similar behavior. 
When time-to-go is short, the curves diverge to infinity, (either positive 
or negative), but a damped oscillation is evident, causing the coefficients 
to approach zero for very long times. 
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Nomenclature 


A 

C 

f 

F 

J 

D 

B 

Q 

$ 

X 

r 

R 

W 

S 

n 

x’fY V 

u,v,v 

t 


Thrust -to-mass ratio 

1_ T 
^ m 

Integration constant 

Rate of change of a state variable 

Fundamental function 

Defined by Eq. 2 

Defined by Eq. A-146 

Defined by Eq. A- 17 ^ 

Defined by Eq. A-lf3 
Defined by Eq A-I 38 
Lagrange multiplier 
Radius 

Radial force 
Normal force 
Circ um ferential force 
Mean angular motion 

Position components in spherical system 
Position components in rectangular system 
Velocity components in x, y, z, directions 
Time 
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Nomenclature ( C ontd . 


T 
T ' 

T\ 

u) 

e 

N 

a 

Q 

i 

*i 

X 3 

% 

Xe 

— ♦ 
c 

Subscripts 


not 

Time to go 
True anomaly 
Longitude of peri-apsis 
Eccentricity 

Unit vector normal to instantaneous transfer orbit 

Semi -major axis 

Longitude of the node 

Inclination 

a/ao 

e sin uu 

e cos tu 

sin i sin Q 

sin i cos Q 

Angular momentum vector 


i Index denoting x,y,z,u ; v,¥ 

j Index denoting x,y,z 

o Initial condition 

f Final condition 

x,y,z,a,v } v Denoting state variable 
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Nomenclature (Contd. 


R Radial 

S Circumferential 

W Normal 

Superscripts 

* Optimum condition 

-» Denotes a vector 
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Appendix I 


Rotating Rectangular and Spherical Coordinate Systems 
1. Equations of State 


dr 

u 

(A-l) 

dy _ 

dr 

V 

(A-2) 

dz _ 
dr 

w 

(A-3) 

u 

zj| 

■op 

Ax + 2y 

(A-4) 

dv 

dr “ 

A y + 3y - 2u 

(A-5 ) 

dw _ 
dr 

A z - z 

(A -6) 


2. Euler- Lagrange Equations 


Xx 

= 0 

(A-7) 

x y 

= -3X v 

(A-8) 

X z 

= x w 

(A-9) 

Xu 

= — X x + 2X V 

(A-10) 

x v 

li 

1 

*c 

1 

no 

c 

(A-ll) 

x w 

= -Xz 

(A-12 ) 

Xu 

= Po Ax 

(A-13) 

Xv 

- 00 Ay 

(A-14) 


= n <A 

(A-15) 
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3 . Integrated Euler -Lagrange Equations 


X* 

n 0 Cq 

(A - 16 ) 

Xy 

= -6n 0 (C 4 + C 0 r -C|COSr + C 2 sinr ) 

‘ (a-it) 

Xz 

= 2n 0 ( C 5 sinr + C 3 cost ) 

(A- 18 ) 

X u 

= n 0 (3C 4 + 3C 0 r - 4C, cost + 4C 2 sinr ) 

(A-19) 

Xv 

= 2n 0 ( C 0 + C, sinr + C 2 cosr) 

(A-20) 

\ w 

= 2 n 0 (C 5 COsr - C 3 sinr ) 

(A-21) 


4. Boundary Conditions 

Trans fer 


Rendezvous 


State Variable 


x 

y 

z 

u 

V 

w 


r =0 
0 

0 

0 

0 

0 

0 


• r = r f 


FREE 


Vf 



0 



r =0 
0 
0 
0 
0 
0 
0 


r = r f 


Vf 


3 H> 
^ y f 



5 . Integrated Equations of State -(with initial conditions) 

x = [ I6(r - sinr ) - -|r 3 ]c o + [l6(l-cosr ) - lOr sinrjc, 

+ [22 sinr - lOrcosr - 1 2r ]c 2 - [ -|- r 2 - 12 ( I - cost )] C 4 


(A-22) 


y = [ 8(1 -cost) - 3r 2 ]c 0 + 5[sinr - rcosr ]c, + [ 5r sinr - 8(l-cosr)jc 2 

, c r . Ip (-A-23) 

+ 6 I sinr -r J C 4 


= [t cost - sinrjc 3 +• [rsinrjc E 


(A-24) 

( 1 ) REF 6 

(2) REF 5 
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(A-25) 


u = [ I6( l-cosr) - r 2 ]c 0 4- [6 sinr - lOrcosr] C, 

4- [jOr sinr — I2( I - cost ) j C 2 4- 12 sinr — 9rj C 4 

v = [ 8 sinr- 6r] C 0 4- [ 5r sinr j C, 4- [ 5 t cost - 3sinr] 

4-3^1 — cost j C 4 

w = [-TsinTjc 3 4- [ sinr 4- tcost]c 5 


6. Transversality Conditions - Transfer 


X, = C c = 0 


5 

C 3 


Wf 

tonr f + — 
T z f 


w f 

1 ~ ^7 tQnr f 


Constants of Integration 
Transfer 


y f sin r f 


C 2 = 


16 ( I - cos Tf ) - T f (5r f 4- 3sinr f ) 
- y f (I - cosr f ) 

!6(l-cosr f ) -T f (5r f 4- 3sinr f ) 


(sinr f 4- r f cosr f ) z f - (r f sinr f ) i 2 - z 2 


r f - sin T f 


yt 


C* = 


-g 5 - ( 5r f 4 3sinr f ) 


16 ( I - cosT f ) -T f (5r f 4- 3sinr f ) 


(A-26) 

(A-2T) 

(A-28) 

(A-29) 

(A-30) 

(A-31) 

(a-32) 

(a-33) 
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Rendezvous 


C 0 = 


f Vf(^ " x) (5T f~ 3 sinr f ) 


r f (5r f — 3sinr f )(r f 2 - 80) 4 4(1- cosr f ){ 7lr f 2 - 64 ) 4- 248r f 2 cosr f 


C, = 


c 2 


y f sinr f 


16(1- cosr f ) - r f ( 5r f + 3sinr f ) 


- y f ( I - cos r f ) 


16(1 - cos Tf ) - r f ( 5r f 4- 3 sinr f ) 


4- C 0 


+ C n 


3sinr f — 8( l-cosr f ) 
5r f - 3sinr f 

3r f ( I 4 cosr f ) - 8 sinr f 
5r f — 3sinr f 


C 3 = 
C 4 = 


/ 2 

( sinr f 4- r f cosr f ) z f - (r f sinr f ) vr 0 
(r f - sin r f ) 

-|r (5r f 4- 3sinr f ) 

I6( I - cosr f ) - r f (5r f 4- 3sinr f ) 


2 2 
“Zf 



/ 2 2 2 

(r f sinr f )z f 4- (r f cos r f - sinr f ) V r 0 i - z f 



( r f 2 

- sin 2 r f ) 

C ontrols 




A x = 

3C 4 4 3C 0 r - 4C| cosr -1- 4C 2 sinr 


Ay = 

2 [c 0 + C, sinr 4- C 2 cost j 


A z = 

2 j^C 5 cos r - C 3 sinrj 


9 • Payoff 
Transfer 

j (- 77 ) (5r f 4-3sinr f ) i 2 

n 0 3 r 0 2 8 ^r f ( 5r f 4 3sinr f ) - 16 ( I - cos r f ) j r f 4|sinr f | 


(a-3*0 

(A-35) 
(A-3 6 ) 

(A-37) 

(A- 38 ) 

(A-39) 

(A-4o) 

(A-41) 

(A-42) 

(A-43) 
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Rendezvous 


3 2 


3 2 

n o r o 


+ J > >2 < A - W > 

f(5r f + 3sinr f ) 

8 [ r f ( 5r f + 3sinr f ) - I6( I - cosr f )j 


~ 2 f r f (5r f - 3sinr f )(r f 2 - 80 ) -I- 4( 


[ 


r f - sinr f cos ( 2.Q, f + r f ) 
(r f 2 — sin 2 r f ) 


] 


cosr f )(7|r f Z -64) + 248r f 2 cos'r f 


(A-45) 


10. It should be pointed out that for each free end condition in the 
case of orbit transfer, the variational analysis predicts an optimum 
value for that particular state variable at the end point. In the 
rotating coordinate systems the x and z coordinates are left open at 
final time, r f . The end point for the optimal transfer is then 
determined in the analysis and is defined by the equations. 



(A-46) 

(A-vr) 
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Appendix II 


Lagrange's Variables 

In the theory of special perturbations, as derived in Ref. 7 for 
example, the equations for rates of change of the elements of an elliptic 
orbit are written in terms of the elements and acceleration components 
S, R, and W, which are perpendicular to the radius vector, radial and 
normal to the orbital plane, respectively. 

Consider the five elements a, e, i ,<u . The equations for 

small rates of change of these variables are 


da 

2 

e R sin rj + S( 1 + e cos 17 ) j 

(A- 48 ) 

dt 

nV'l - e* 

de 

V\ - e 2 

\ R , inr) , 2c057 ? + e + ecos ^ e 1 

(A- 49 ) 

dt 

na 

R sin 17 -t- 0 

L ' 1 -1- e cos 17 J 

di 

dt 

yi - e 2 

na 

W COS (a > + rj) 

(A- 50 ) 

dou 

dt 

</l - e 2 r 
nae L 

2 + e cos 17 e ,an " 5 " sinM-77) 

Roost? + S sinor W 

' 1 + e cos rj ' 1 + e cos 77 

1 (A- 51 ) 

d £1 . 

dt 

y-e 2 

na 

w 

— — sinloj+'n) 
sin 1 1 

(A-52) 


In order to avoid singularities for zero eccentricity and 
inclination in Eqs . A -51 and A-52 these equations may be transformed 

according to the following definitions: 


*2 = 

e sin gj 

(A- 53 ) 

x 3 = 

e cos co 

(A-54) 

*5 = 

sin i sinft 

(A- 55 ) 

x 6 = 

sin i cos £1 

(A-56) 
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Under the assumptions 


e << I 
a ~ a 0 
n ~ n 0 

r = n Q t = oj + t) 
i << I 




A s = 


w 




and with the further definitions 


(A-57) 


(A-58) 


a 


x 4 = x 


(A-59) 

(A- 6 o) 


the equations of state for the variational problem may be derived from 
Eqs. A-48 through A -56 . 


There is a direct equivalence between these equations and the 
equations of state in the rotating coordinate system variables. That 
is, each of the Lagrange variables x x , x 2 , x 3 , . . . x^ , can be expressed 
in terms of the rotating coordinate variables, x, y, z, u, v, and w. 


Referring to Fig. 25 , define a position vector r in nonrotating 
coordinates originating at the center of attraction F. Assume the motion 
out of the reference plane is uncoupled from the in-plane motion. 


Relative to a rotating rectangular coordinate system originating at 
0 and rotating with angular velocity 7T this vector is 


r = X i + ( r 0 + y ) j 


(A- 61 ) 


where the unit vectors i and j are taken in the x and y directions, 
respectively. The vector velocity V is obtained by differentiating r . 

^ d r . m m * r 

V = ■ . . = ui + vj 4- nxr 

dl (A-62) 
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(a-63) 


Since n - n 0 k y the expression for V is 


V 


= [ u — n 0 ( r 0 + v.]T + ( v •+■ n 0 x ) j 


Using Eqs. A-6l and A-63 , expressions can be written for the 
angular momentum cf , the path speed V and the radius r of the vehicle 


C = r x V 


= [x(v + n 0 x)- ( r 0 + y )( u - n 0 ( r 0 + y ))] k (A- 64 ) 


v = v v • v 


u - n 0 ( r 0 + y ) -I- [v + n 0 x 


(A- 65 ) 


r = J r • r 


x 2 + ( r 0 + y ) 2 


(A -66) 


The following equations can be written for the angular momentum, speed 
and radius of a body in an inverse square field . 


I C I = /Ka( I - e 2 ) 

v = T ~ a" ) = v^(r ) 2 + (r^) 2 

r _ o(l - e 2 ) 

I + e cos 77 

Combining these equations with the absolute value of C , and with 
V and r from Eqs. A -64 , A-65 and A-66 the following scalar 

equations result. 


(A-67) 

(A-68) 

(A- 69 ) 


— - ( l + y ) ( I + e cos i? ) 

a o 'o 



(A-70) 

(A-71) 

(A-72) 
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Finally, noting that 


— = x, , x, = e sinu> . x, = e cosoj 

a 0 i ’ 2 3 

e cos Tj = e cos(t-oj) = x 2 sinr -1 x 3 cosr 
the equations relating the coordinates are obtained. 

= ( x, - I ) - x 2 sinr - x 3 cost 

-r—r = x, cost - x p sinr 

'* 0*0 3 c 

■zr— = -i- ( x, - I ) - 2x, sinr — 2 x,cost 

n o r o 2 * 3 

The components of the out-of- plane motion can he related in the 
following way. If N is a unit vector normal to the instantaneous 
transfer orbit and s is a unit vector in the direction of the line of 
nodes, then 

r = "n x~K (A-77) 

and, since the angle between s and the vehicle is r ~ £1 

cos (t -£l) = s • i (A- 78 ) 

Also, the orbital inclination is 

cos i = N • K (A-79) 

Using these parameters the equation for the elevation, z, of the probe 
is 


(A-73) 

(A-7M 

(A-75) 

(A-76) 


T~ = tan i sin(r-XZ) — sin i sin(r-il) 

1 O 


(A-80) 


or 


z 

— = — Xc cos t + x fi sinr 

r o 


(A-81) 
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The out-of-plane velocity, w, is 


TP- = x s sinr + x fi cost {A-82) 

n 0 r 0 D ° 

1. Equations of State 


U A | 

dr 

= 2A S 


dx ? 


(A-83) 

dr 

= 2AgSinr — A r cosr 


dx 3 

dr 

= 2A g cosr 4- A r sinr 

(A-84) 

(A- 85 ) 

dx A 

3 

(A-86) 

dr 

- -g- (x,-l ) - 2x 2 sinr — 2x 3 cosr 

dx 5 

= — A w sinr 

(A-8t) 

dr 

dx 6 

dr 

= A w cosr 

(A -88) 


2 . Euler-Lagrange Equations 


K 2 

- _ A x 

2 A 4 

= 2X 4 sinr 

(A-89) 

(A-90) 

K 3 

= 2X 4 cosr 

(A-91) 

X 4 

= K 6 = K 6 = 0 

(A-92) 



(A-93) 

n 0 A s = 2 ( X| + X 2 sinr 4- X 3 cosr ) 

n 0 A R = - X 2 cosr 4- X 3 sinr 

(A-94) 

n 0 A w = X 3 

sin r 4- X 6 cos r 

(A-95) 
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3 . Integrated Euler-Lagrange Equations 


\ = \o 2 ^4 r 

(A- 96 ) 

Xg = X20 2X4 COST 

(A- 9 T) 

^ 3 = ^30 +2X4 sinr 

(A-98) 

X 4 = CONSTANT 

(A - 99 ) 

\ " 
*5 - 

(A-100) 

\ " 
K e _ 

(A-lOl) 


k . Boundary Conditions 


A great simplification in the complexity of the equations can be 
achieved by taking advantage of the symmetry afforded by the Lagrange 
variables x^ and x 3 . Therefore, in performing the integrations it will 


be convenient to 

use limits— 

r f /2 to r f / 2 

for the "in -plane" 

state 

variables . 

Transfer 

Rendezvous 



State Variable 
("in -plane") 

-1 

m 

l 


^C\J 

I 

II 

r =IL 
2 

x i 

1 

O 

-4» 

+ 

1 

Ax i f + 1 

*2 

x 20 

x 20+ Ax gf 

x 20 

x 20 + Ax 2 f 

*3 

x 30 

x 30 + A X 3f 

x 30 

X 30 + Ax 3 f 

*4 

x 40 

FREE 

x 40 

X 40 '*‘ Ax 4 f 

( out-of -plane) 

T = 0 

T=r f 

r =0 

r = r f 

*5 

0 

X 5f 

0 

X 5f 

*6 

0 


0 

X 6f 



5. Integrated Equations of State (with initial conditions) 

2 

Ax, : 4X |0 ( r + y ) - 4X 20 (cosr- cos ^- ) + 4X^1 sinr + sin - 3X 4 ( r 2 - ) 

Ax 2 =- 4X I0 ( cosr- cos ) + 4p[ 5(r + ) - 3(sinr cosr + )] A-102) 




3 qv sin r — sm 


- 2 -g- ) - 2X 4 [ 4 ( sinr 4- sin ) - 3 ( r cos r + -jj- cos ) 

.. 3 p p r. ( A-103J 

Ax 3 = 4X |0 ( sinr + sin X 20 (sin r — sin ^- ) 

4- 5 (r+ -£-) t 3(sinr cost 4 )] 


- 2X 4 [ 4(cosr - C( ? f ) 4- 3( r cost 4- cos -^-)] 


Ax 4 = X l0 { 3 (r + ^ ) 2 -8 [ I - cos (r + ^ )] } 


(A-104) 


+X 20 ((r 4- ^ ) [ 5 cosr + 6 cos ^-] - sin(r 4^-) — sinr - 8 sin j 

+ ^ 3 o{( r + ~2 ) [ 6 sin^|- - 5sin r j -I- cos(r 4^-)— cost + 8 cos } 

+ X 4 ( 16 (r + -^) - 6 r f [ I - cos (r 4- )] 4 3 (y ) -4 -|-t -(^-) 2 1" r3 } 

4- 2 x 20 [ cos r - cos - 75 - ] - 2 x 30 [ sin r 4 sin ] 

(A- 106 ) 


X K Xg p 

x 5 = — ( t - sin r cost ) - — sin r 

Xp n 

x fi = =- sin r + - 5 - (T 4- sinr cost ) 


xA-105) 


(A- 107) 


6 . Transversality Conditions - Transfer 


7. Constants of Integration 


Trans fer 


Ax 


X 4 = 0 


X, 

— 2 = tanr 

^6 


MO 


^-4 (5r f + 3sin r f ) — 4&x 3f sin -4 
r f ( 5 r f + 3 sin r f ) - 16(1- Cos r f ) 


(A-108) 

(A-109) 


(A-Iio) 
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(A-lll) 


^•20 = 


2 

5 r* - 3sinr f 


^30 = 


2 [ T f Ax 3f“ 2Ax lf Sin ^] 

r f ( 5r f -I- 3sinr f ) - 16(1- cosr f ) 

K 5f ( r f 4- sinr f cost) + x 6f sin 2 r f 
2(r f 2 - sin 2 r f ) 


Rendezvous 


‘10 


(5r f + 3sinr f ) -4Ax 3f sin-^- 
r f ( 5r f + 3sinr f ) - 16(1 — cos r f ) 


^20 ' 


r f ( 5r f - 3sinr f ) ( j|r f 2 + I) - 2 ( 8 sin y-3r f cos 4p 2 


3 . . t, 

■4" r f AX| f ( 3r f cos ■ 


+ Ax 2f [-^- T- f 3 +8r f — 3r f ( i - cosr f ) - 8sinr f j 
- [3T f cos-jj- - 8sin-^-] [2Ax 3f sin-^ 4- Ax 4f + 4x3Qsin^- ] 


30 


2 [r f Ax 3f - 2 Ax |f sin j 
r f ( 5r f + 3sinr f ) -l6(l~cosr f ) 


r f (5r f - 3sinr f ) (-j^rr f 2 + 1 ) - 2(8sin j - 3n f cos > 2 

Axgf r r f t* rf -] 

— [llTfCOS-^- + 3 sin-?*- ( I - cosr f ) - 22sin -j- I 

r Ax, f r f Ax 4f Tf 1 

+ ( 5r f - 3sinr f ) [ -p-sin - -I- x^sin -p J 


l| T fAx |f (5r f - 


(A-112) 

(A-113) 

(A-114) 
8 sin ) 

(A-115) 
(A-116) 
- 3sinr f ) 

(A-11T) 
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\ 5 = 


\; = 


{ X 5f (T f 


. 2 

4- sinr f cosr f ) + x gf sin r, 


f } 2 i [ T f sir»XZ f + sinr f sin (£l f +r f ) j 


2 . 2 
r f - sin r f 


2 . 2 
r f - sin r f 


2 | x 6f sin 2 r f + x 6f (r f -sinr f cosr f ) } 


(A-118) 

2 i [ r f cos^i f - sinr f cos(%4r f ) j 


2 • 2 
r f - sin r f 


2 _• 2 


r f - sin T f (A- 119 ) 


8. Controls 


n 0 A s = 2 \ | 0 - 3X 4 r + 2X 20 sinr +■ 2X30 cost 
n 0 A r = 2 X 4 - X 20 cos r 4- Xjq sin r 

n 0 A w = ~ X 5 sinr + X g cosr 


(A-120) 

(A-121) 

(A-122) 


9. Payoff 
Transfer 


_ 3 , 2 

n 0 r 0 


Ax. 


(5tj + 3sinr f ) — 4 A x |f Ax Jf sin-^- + ,r f Ax Jf 2 


r f ( 5r f 4- 3sinr f ) - 16 ( I — cosr f ) 


2 A x 


.2 


2f 


5r f - 3sinr f 


T f 4 I sin r f | 


(A-123) 
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Rendezvous 


A x 


8 


2 T 

— ( 5 r f + 3 sinr f ) - 4 Ax |f Ax 3 f sin-^ + r f Ax ^ 2 


n 0 V 


8 


r f ( 5 r f + 3 sinr f )- I 6 ( I - cosr f ) 

( 5 r f - 3 sinr f ) ( 2 Ax 2f cosy - 2 Ax 3 f sin-^ - Ax 4f + r f Ax (f - } 


r f ( 5r f - 3sinr f ) ( ^|- r f 2 + | ) - 2(3r f cos -y - 8 sin^ f 
Ax^ ( 3r f cos - 8 sin ^ ) { 2Ax 2f cos ^ - 2 Ax 3f sin ~Ax 4f + -|~r f Ax |f -4x^0 ^ } 


16 

T f Ax 2 f 2 ( l| r f 2+ 1 ] 


r f ( 5 r f - 3 sinr f )( -^r r f 2 4 - I ) - 2 ( 3 r f cos-^- - 8 sin ) 


r f ( 5r f - 3 sinr f ) { r f 2 + I ) — 2 ( 3r f cos - 8 sin ^ ) 

2 (" — sin r f cos ( 2ii f + r f ) 

' [ (T f 2 - sin 2 r f ) 


(A-124) 


10. The optimal values for changes in the state variables x* and £5, 
are predicted by the variational analysis in the case of orbit transfer 
where the values x* and are left open at the final time. 


* 3 T f T f 

Ax 4 = 4 "T f Ax lf - 2 AX 3 , sin-g- - 4 xjoSin ^ 


4 Ax 2f 


5 r f - 3 sinr f 


( 4 -cos-x( 5 r.- 3 sinT f ) 4 - 3 T.cos^- 8 sin-^-} 
12 2 f f f 2 2 -> 


ii f = n7r — 


(A-125) 

(A-126) 
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Appendix III 


Synthesis of the Optimal Controls 
A. Rotating Coordinates 
1. Control Equations 


Ay 

Ax 

A z 


a \ 


a a 

y .. , 

dA 

y u 1 

aA y 

a y y 

1 

du 

a v v 1 

ax 

aA x 

+ 

d A X , 

dA « , + 

d \ 

a y y 

a u u + 

a v v + 

(3x 

dA t 

— 7 

-1- 

dA z 

— \u 




dz 7 + dw 


2 . 

dAy 


dy 


Guidance 


12 r* 


$ 


Coefficients -Transfer 


I - cost' )( 29 - 27cosr') 


dA y 

du 

dAy 

dv 


<m x 

a y 


a A x 

au 


24 


— cost') ( II sin r‘ - 3r‘ cosr' - 8r' ) 


12 

12 | 

4> 1 


6 

$ 


5r' 2 + 3r' sinr' cost' - 8 sin 2 r' ) 


70'r‘sinr' — 55r' 2 I8r' sinr 1 cosr'+- 3 ( I — cosr>)( 


65r' 80r' sinr' — 24 r' sinr' cost'— ( | — cosr')( 25 


(A-127) 

(A-128) 

(A-129) 

(A-130) 

(A-131) 

(A-132) 

- 27cosr')] 
(A-133) 

103 cosr* )j 
(A-134) 
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I 


where 


<?A X 


24 

$ 


( 8r' - II sinr 1 4- 3 r 1 cost 1 )( I — cost* ) 


(i) 


(A-135) 


aA z 

dz 

dA z 
a w 


— 2 sin 2 r' 
r' 2 - sin 2 r' 


— ( 2 r'- sin2r') 
r' 2 - sin 2 r' 


(A- 136) 
(A-137) 


= 480r' — 75 r' 3 — 240r'cosr' ( I + cost') — 144 sinr'( I - cost') — 21 3r' sin 2 r' 

(A-138) 


3 . Rendezvous 


Due to the length and complexity of the synthesized, in-plane, 
control equaticns for rendezvous, the guidance coefficients are not 
written explicitly here. Instead the basic equations are tabulated, 
and the coefficients calculated from these equations are plotted in 
Fig. 19 through 21. 


i 

I 

I 

t 

i •* 

i 

i 





_aA, 

dx; 


ac 4 ac 0 ac 

— — — 3 — r‘ - 4 

dx ; dx. 


dC, 


I p 

"r — cost' — 4 -r — sinr' 
dx; dxj 


(a-139) 


a A y 
dx,- 


ac rt ac, 

axi 


,(Ko 

\ axj 


ac 2 

sinr' H — cosr' 


dx 


(A-l4o) 


^A z 

dz 


z + 


aA, 


w 




(A-l4l) 


(,) NOTE : 


dA x dA y 

dv d u 
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X 

*., 

d> 

M2 

d> 

'14 


d> 

MO 

X 

*« 

*w 

y 

*2. 

*22 

*24 

p _ 

*20 

y 

£22 

*24 

u 

*3, 

^32 

^34 

c i - 

*30 

u 

*32 

‘*34 

V 

*4, 

* 42 

*44 


*40 

V 

*42 

*44 


D (A-lte) D (A- 143 ) 


*10 

*,. 

X 

*14 


*,0 

*11 

*12 

X 

*20 

*21 

y 

*24 

c - 

*20 

*21 

*22 

y 

* 

^30 

*3, 

U 

<f> 

'34 

^4 

d> 

^30 

<£ 

r 3l 

<4 

'32 

u 

*40 

*41 

V 

*44 


*40 

*41 

*42 

V 


D (A- 144 ) D (A- 145 ) 

where 


* 

MO 

* 

Ml 

* 

M2 

M4 

*20 

*21 

*22 

*24 

*30 

*3, 

*32 

*34 

*io 

*4, 

*» 2 

*44 


(A-i 46 ) 


and 

= -f '■' 3 - 8 t' + 8 S i„r' 

<£„ - 8 ( I - COST') - 5 -r'sinr' 

4> [Z = 5 r'cosr' - II sinr 1 4 - 6 r' 

9 2 

<£ i 4 = 6(1- cost') - -4 r 


£30 = 8(1 - cosr,) “ T T ' 2 

<£ 3| = 5 r‘ cost' - 3 sinr' 


^> 32 = 5 r'sinr' - 6(1 -cost') 



- 6 sinr' 
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^20 

*2, = 

<k>2 = 

^24 = 


4 ( 1 - COST-') — T Z 

[ T'cosr' - sinr'j 

r<sin t* - 4 ( 1 - cost' ) 

3 ( r< - sinr«) 


3 t' - 4 sinr* 
5 

-g- r’sin t* 


^40 = 

*4, = 

<k>2 = 

d> = — 3 ( I - COST') 
r 44 


-x R 

sinr' T'cosr' 


(A-147) 


8a 7 


- 2 sin 2 r 


dz t 2 — sin 2 r' 

dA z - ( 2 t' - sin 2 t‘) 


d w 


t' 2 — sin 2 r‘ 


(a-i48) 


(a-i49) 


B . Lagrange Variables 

1. Control Equations 



dA R 

aA R 


aA R 

a r = 

dA*, Ax ' + 

dA, 2 A ‘* + 

dA* 3 AXs + 

aAx 4 


a a. 

aA q 

aAc 

aAc 

A S = 

Ax + 
oAx | 1 

dA* 2 Ai, 2 + 

■®r s Ax » + 

O 

^ A X 4 

A w ' 

dA w 

Ax 9 4 - 
aAx 5 9 

*<6 
aAx 6 6 




<?A r 

^ x 4 + “aT" x 30 

OX 30 


(A-150) 


(3a s 

Ax * + “a^ 30 (A- 151 ) 


2. Guidance Coefficients-Transfer 


(A-152) 


dA 


R 


T 1 

— 4 sinr' sin -g- 


dAx. t'( 5t* + 3sinr') - I6( I - cost') 


(A-153) 
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2 COST 


(A-154) 


aAx 2 5 r' - 3 sinr 1 

dA R 2 r' sinr' 

3 Ax 3 tX 5 t* + 3 sinr') - I 6 ( I - cost') 

<3A $ 8 cosr'sin - -£-( 5 r' + 3 Sinr') 

“aAx^ = r'( 5 r‘ + 3 sinr') - 16(1 - cost') 


(A-155) 


(A-156) 


dA s 4sinr' 

dAx 2 5r'-3sinr' 

5A w cosr'sin 2 r' 

0 Ax 5 T | 2 -sin 2 r' 

dA w - -jr cosr'( 2r‘- sin2r‘) 

dAx 6 r' 2 - sin 2 r' 

dA s 4(2 sin y - r'cosr' ) 

dAx 3 i- , (5r , + 3sinr') — 16(1 - cos r') 


(A-15T) 


(A-158) 


(A-159) 


(A-i 6 o) 


3 . Guidance Coefficients -Rendezvous 


, *3 r t 1 T* 1 

dA R 4 sinr'sin -g r[ 5r' - 3 sinr' + 2 cosr'( 3r'cos - 8 sin y )J 

5Ax, Q B 


(A-l 6 l) 


ua r 

aAxg 


212 


dA R 

- 2r<sinr 

' sin y [sr 1 - 3sinr' + 2cosr'( 3r* cos y -8siny) 

- “ f" ' ' 

L 

’ dAx 3 

Q 

B 

(A-163) 

<3A r 

y [ 5r‘ - 3sinr' + 2cost'( 3r'cosy - 8 siny )] 

(A- 164 ) 

> 

X 

A 

1 


B 

dA R 

2 sin y 

[ 5t‘— 3sinr' + 2cosr'( 3r'cos y - 8 siny )] 

(A-165) 

^*30 


B 


dA s 

y (5r‘ 

J- * 

+ 3sinr'- 16 siny cosr') 


(3Ax| 


Q 

(A-166) 


ll"' 

3r'( 5t' — 3 sinr') + 8 sinr'( 3r'cosy -8sin-y)J 




B 


dA s 

4r'sinrt 

-jy r' 2 + 1 ) + y t'cos y ( 5r- - 3sinr') 

(A-167) 

dAx 2 

r' 8 r- r 

( 3r'cos y - 8 sin y )( 3r'+ 4sinr'cos y ) 


1 

B 


dA s 

4( r'cosr' 

- 2 sin y ) 

(A-168) 

dA«,' 

y sin y[ 3r' ( 5r'— 3sinr') + Ssin^ 3r'cos y - 8siny )] 



B 

• 1 1 T 

(A-169) 


1 

4 

3t‘(5t'- 3sinr') 4- 8sinr‘( 3r'cos y - 8siny )J 

^ A 


B 


dA s 

r 1 

sin 

3r'( 5r‘- 3 sinr 1 ) + 8sin r'( 3r'cos y - 8 sin y )] 

(A-170) 

^*30 


B 


aA w 

2 sinr'( r* + sin 2r‘) 

(A-171) 

aAx 5 

= T*- 

. 2 

sin r' 

dA w 

2 [ sinr 1 + cosr‘( r 1 — sin 2r')j 

(A-172) 

<3Ax 6 


r ,z — sin 2 r‘ 
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where 


Q = 16(1 -cost') - r'( 5 t' + 3sinr') 

B = r'( 5 t' - 3sinr')( ^ r' 2 + I ) - 2( 8 sin ^ - 3r'cos-|- 


(A-1T3) 
2 (A-174) 
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FIG. 8 


NORMAL ACCELERATION 

CIRCLE - TO - CIRCLE RENDEZVOUS 

r f = 60 ° 

OPTIMUM TRANSFERS : £i f = 150°, 330° 
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GUIDANCE COEFFICIENTS FOR OPTIMUM CONTROL 

ORBIT TRANSFER 


FIG. 14 
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FIG. 15 
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DEFINITION OF SYMBOLS 


U 

R 

r 

V 

AV 

AV 

k 

T 

m 

9 

m 

c 

X, X, <r 

X 


X 

X 


£ 


X 


r> 


t 


t 


1 


Gravitational constant 
Vehicle position vector 
| R | = magnitude of R 
Velocity vector of vehicle 
Impulse velocity vector 
| A V | = magnitude of A V 
Magnitude of thrust 
Unit vector in direction of thrust 
Mass of vehicle 
Mass flow 

Constant, proportional to specific impulse 
Lagrange multipliers or adjoint variables 

|X| = magnitude of X 

• • 

| X 1 = magnitude of X 

Component of A. parallel to R 
Component of X_ perpendicular to R 
Time 

Time at end of first thrust period 
Time at beginning of second thrust period 
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SUBSCRIPTS 


o Initial value 
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REPUBLIC AVIATION CORPORATION 
Farmingdale, L.I. , New York 


APPROXIMATE INITIAL VALUES 
OF LAGRANGE MULTIPLIERS FOR THE 
TWO POINT BOUNDARY VALUE PROBLEM 


By 

Jack Richman 


SUMMARY 


This report describes a method for obtaining a first estimate of initial 
values of the Lagrange multipliers for the "Two Point Boundary Value Problem 
of the Calculus of Variations". 

This first estimate is obtained by assuming the "Two Impulse Orbit 
Transfer Problem" to be a reasonably close approximation to the Calculus 
of Variations problem. 
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INTRODUCTION 


The method used to solve the two point boundary value problem of the 
calculus of variations is one where the decision functions are such that all the 
trajectories being used are extremals [l]. In addition to the state variables, 
that appear in die equations of motion, there are a number of adjoint variables 
or Lagrange multipliers that satisfy additional equations for the optimization of 
the given system. The boundary conditions for the adjoint variables define the 
natural end-point conditions of the state variables. This natural end point, in 
general will not be the desired end point. A differential correction scheme 
provide the means of obtaining another optimum trajectory, the natural end 
point of which will be closer to the desired end point [2] . 


The equations of motion of the vehicle in the gravitational field of a single 
body subject to thrust are as follows: 


R = --U— + - T 


k 
m — 


( 1 ) 


m (tg) = m (t A ) + J*® m 


dt 


( 2 ) 


where m = - - and T is a unit vector parallel to the direction of thrust. 

The optimum decision functions are determined with the help of the 
Lagrange multipliers, X , X , and a which satisfy the following equations 

V _ u x 3 U (X R) R 

X 


(3) 


^ . 

a (tg) = <r(t A ) + J t o dt 


(4) 
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where 



The thrusting program is determined by the sign of the switching function 
S, which is given by 



> 0 
< 0 


k = k 


max 


k = k 


mm 


( 5 ) 


The direction of the unit thrust vector T is given by the direction of the 
Lagrange multiplier X 



The natural end point if reached when 

o (t p ) = 1 ( 7 > 

The problem is to generate a set of initial values of the Lagrange multi- 
pliers such that an optimum orbit can be computed, where the natural end 
point matches the desired end point. This is accomplished by obtaining a first 
estimate of the initial values and improving these by using a differential correc- 
tion scheme. 

One of the requirements necessary for a rapid convergence of the differ- 
ential correction scheme is that the first estimate of the initial values of the 
Lagrange multipliers be reasonably close. The following is a method for ob- 
taining a first crude estimate of the initial values of the Lagrange multipliers. 
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INITIAL VALUES OF LAGRANGE MULTIPLIERS 


First Method 

A first estimate for the initial values of the Lagrange multipliers can be 
obtained by making the following assumptions about the trajectory. 


(a) Two burning periods are required to accomplish the optimum tra- 
jectory, one occurring in the time interval t to t^ and the other 
in the time interval 1 2 to t^. During the time interval t^ to t g the 
vehicle is in a coasting region. 

(b) The time intervals in the thrust regions are so small that A V (t Q ) 
and AV(tp) are obtained by solving the "two-impulse orbit transfer" 
problem, where 


AV(t 0 ) = Vftp) - V(t Q ) 


AV(t f ) = V(t f ) - V(t 2 ) 


( 8 ) 


(c) In the regions of thrust the gravitational force may be neglected. 


If in addition we assume that the thrust direction is fixed the differential 
equations for the state variables and the Lagrange multipliers, within the burning 
region reduce to 



X = 0 


where 


t 


cr (t) = cr(t A ) + f. ordt 

A .t A 


CT = - 


cihX 


m 


( 9 ) 

( 10 ) 

( 11 ) 


( 12 ) 
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and 


m (t) = m(t A ) + (t-t A ) m 


(13) 


In the burning regions the thrust vector is in the direction of AV. Therefore 
from Eq. (6) we have 

AV 

(14) 

In the coasting region, m and a are constant. Thus, it follows that 

cr^) - cr(t 2 ) (15) 


m^) = m(t 2 ) 


(16) 


For the computations of the initial values of the Lagrange multipliers, one 
proceeds as follows: 

First Eqs. (9) and (10) are integrated in the two burning regions t to 
and t 0 to t f , resulting in 

AV o 


m(t 1 ) = m(t Q ) e c 

(AV o +AV f ) 

(17) 

m(t f ) = m(t Q ) e c 

(18) 

^(t^) = ^(t Q ) = constant 

(19) 

X(t 2 ) = X(t^) = constant 

(20) 

^(t x ) = >L(t 0 ) +(t 1 -t Q )X(t o ) 

(21) 

* (t f ) = X (t 2 ) + (t f - 1 2 ) X (t 2 ) 

(22) 


where the time spent in the two burning regions is computed by using Eqs. (13), 
(16), (17), and (18), and is given by 
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AV. 


< t i-V = 


< t rV = 


m(t o )(e 


- 1 ) 


m 


m (t Q ) e c (e e 


(23) 


1 ) 


m 


(24) 


From the assumption that the thrust direction is fixed during each burning 
interval it is evident that X and X are in the same direction. Therefore only the 
magnitude of X and X need be considered, i. e. X and X' . 

At the transition times t^ and t 2 the switching function must be zero. Thus, 

X(t x ) (X (tj) 


and 


m^) c 


X(t 2 ) a(t 2 ) 

m(t 2 ) - ~c 


(25) 


(26) 


It can be shown that by integrating Eq. (11) in the two burning regions and 
m aking use of Eqs. (12) through (26) one forms the following three independent 
equations with five unknowns, i.e., CT(t Q ), X (t Q ), X*(t o ), X (t^) and X (t^) 


AV. 


m(t o ) - - m 

x*(t o ) = 


AVf 

m(t 0 ) X <V + 5 C 1 

-e c 


(27) 


AV o AV f 


X(tJ 


AV, 


+ ^ [c(l . e c >AV f ]- 1 


(28) 
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e 


AV. 


m <y 


[x ( t 0 )-X(t f )] + 4-(l-e 



(29) 


• • 

By making use of the transversality condition X/R-X • V + orh = 0 at times 

t and t r one can obtain two more equations, 
o t 


-X(t o ) 


V(t 0 )-Av 
AV„ 


cm 


m(t Q ) '-o 


X(t) + a (t ) m = 0 


-X(U 


Y(t f )- AV f 


AV +AV, 

o t 


AV, 


cm „ c 

e 


m(t o ) 


X (tf) + m = 0 


(30) 


(31) 


Eqs. (27) through (31) constitute five equations with five unknowns. The 
solution of this system of equations is given by 


(AV +AV, 

miy V— 

c e 

AV o 

(32) 

x (t 0 ) = 0 


(33) 

( AV C + AV f ) 



or(t Q ) = e c 


(34) 

(AV +AV-) 

m(t o ) " c 

c e 

AVf 

AVf 

(35) 

IX 

II 

o 


(36) 


It is of interest to note that the magnitudes of X at the initial and final times 
are equal and directly proportional to the mass at the final time. In addition, the 
value of cr is also proportional to the final mass and may be expressed as 


a(t) = 


m(t f ) 

m (t) 


(37) 


252 


Second Method 


An approach for obtaining a better first approximation is to remove or at 
least "relax" some of the assumptions made in the first method. More specifi- 
cally, instead of completely neglecting the gravitational force in the regions of 
thrust it can be assumed that the gravitational force has a constant value of 

— g- in the first region and — g— in the second region. 

r r* 

o f 

In addition, we assume that the direction of the total acceleration in the 
two regions of thrust is parallel to the vector AV^ and AV f , respectively. This 
implies that the direction of the thrust is not fixed. 


It is clear that in the region of thrust the vector X lies in the plane formed 
by the vectors R and AV . It is most convenient to resolve X into components 
along the vector R and normal to it. These two components are designated as 
X^ and X^ , respectively. 


The differential equation for X can now be written as 




t 


(38) 


X 

y ^ y 


(39) 


The solution to Eqs. (38) and (39) is £iven by 


X^(t o ) cosh 


/IjT V* o )sinh 


(40) 

X (t )cos . 
Tj' o' \ 


lJ jr <‘o> sin J 

SL t 

3 1 
r 

(41) 


Since the intervals of thrust are assumed to be of short duration it is per- 
missible to approximate Eqs. (40) and (41) in the regions of thrust by neglecting 
the second order terms of a Taylor series expansion, i.e. , 
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t ^tst, 
o 1 


( 42 ) 


X(t)« X(t Q ) +(t-t o )X*(t Q ) 
X(t)«x (t 2 ) +(t-t 2 )X* (t 2 ) 


St =£t f 


(43) 


Similarly, one can approximate X’ in the regions of thrust to the same order of 
accuracy. 


X ? <t)~^<t-t 0 )X € (t 0 ) + X s <t 0 ) 

(44) 

t ^t^t. 

O 1 



(45) 

X" ^ (t) ~ ^ (t - tg) X^ (tg) + (tg) 

(46) 

t 2 ^ tgt f 


\ W « - -*3 <* - ty \ + \ 

(47) 


The procedure for obtaining the initial values of the Lagrange multipliers 
is now the same as in the first method except that Eqs. (19) through (22) are now 
replaced by Eqs. (42) through (47). 


CONCLUSION 

A set of approximate initial values of the Lagrange multipliers have been 
derived. In addition, a method for obtaining a better first approximation has 
been outlined. It should be pointed out, however, that as one attempts to obtain 
these improved first approximations in the manner outlined, the algebraic mani- 
pulation of the expressions involved become more cumbersome and additional 
approximations may be needed. 
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SUMMARY 


This report is submitted in partial fulfillment of the contract in "Space 
Flight and Guidance Theory, " No. NAS8- 11040. It presents a discussion of 
Lagerstrom and Kevorkian's two-variable expansion method for the compu- 
tation of lunar trajectories. Section 2 discusses the general background of 
the method in terms of singular perturbation theory. Section 3 discusses 
the major steps in the development of a uniformly valid solution for earth- 
moon trajectories and Section 4 presents a slightly different approach to the 
same problem. 
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1. INTRODUCTION 


In refs. (3) and (4) a new method was suggested by Lagerstrom and 
Kevorkian for the computation of lunar trajectories. The method was similar 
to one which had been used successfully in a number of singular perturbation 
problems of boundary layer theory (refs. 1, 2). The result of approaching 
the lunar trajectory problem as a singular perturbation problem was a uni- 
formly valid solution (i. e. valid everywhere in the earth-moon space) to 
first order in the parameter for a certain class of trajectories. The class 
of trajectories is that which starts in a neighborhood of order near the 

earth and arrives near the moon to within a neighborhood of order . 

Similarly to other singular perturbation problems, this uniformly valid 
solution was obtained by formulating two solutions, one valid near the earth 
(the "outer solution") and the other valid near the moon (the "inner solution"). 
The inner solution is expressed in terms of "blown up" variables. The outer 
and inner solutions are left undetermined by introducing a number of con- 
stants; these constants are determined such that the singularities in the 
outer and inner solutions cancel when they are combined to form the "com- 
posite solution. " 

The basic idea of the method was worked out in its application to the 
two-fixed center problem with special initial conditions (ref. 3); then the 
same technique was used in the restricted three body problem with more 
general initial conditions (ref. 4). One of the most interesting results was 
the finding that the outer solution must contain a part which is proportional 


260 


, or else it cannot be matched to the inner 


to the small parameter 
solution; the outer solution can thus be interpreted as an earth centered 
Kepler ellipse with a first order correction to take care of the moon's per- 
turbation, In comparing this method with the usual way of ’’patching conics”, 
it was thus stated that a patched conic method could not be accurate, unless 
the geocentric ellipse were corrected for the moon's perturbation. The two- 
variable expansion method was thus offered as an improvement over patched 
conic methods and it appeared to be (at least initially) equally practical. 

This report presents an explanation of the method (in Section 3), based 
mostly on ref. 4, and the beginning of a somewhat different approach (in 
Section 4). The claim that this report is an "explanation” is made with all 
modesty; it is an explanation in the sense that it presents and discusses the 
major steps of the developments in ref. 4, leaving out many of the laborious 
details. In this way it is hoped that the reader may gain a full appreciation 
and understanding of this very interesting method; this report may thus serve 
as an introduction to the reading of refs. 3 and 4. This explanation is pre- 
ceded (Section 2) by a general discussion of singular perturbation theory, 
based mostly on ref. 1 and 2. In particular with respect to this section, 
and the conjecture and theorem on which the discussion is based, the authors 
gratefully acknowledge personal communication with Dr. Kevorkian. 

In Section 4 the beginning of a slightly different approach to the same 
problem is presented. Whereas the work by Lagerstrom and Kevorkian is 
formulated in inertial coordinates, this new approach makes use of rotating 
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coordinates, and the Jacobi Integral in order to solve the problem as a third 
order system of differential equations. 


2. DISCUSSION OF THE TWO- VARIABLE EXPANSION METHOD 


The method used by Lagerstrom and'Kevorkian to formulate a uniformly 
valid representation of earth-moon trajectories is that which is used in the 
singular perturbation problems of boundary layer theory. A singular per- 
turbation problem may be characterized as follows: a differential equation 

/ ( Xj tt } c) ^ O and boundary conditions y — O depend on a 

small positive parameter £ in such a way that the order or type of / 

change when ^ = o , while the number of boundary conditions remains unchanged. 
Thus, if represents the solution of f Xj & y oJ — 0 > one may not 

expect that approaches uniformly as £ — ^ 

Fundamental to the solution of singular perturbation problems is the 
introduction of certain limits. Consider functions ^ of £_ , positive and 
continuous in 0 >< ^ and tending to a definite limit as £ — ^ 0\ intro- 
duce a new variable , then a limit on /~ f X y Bj is defined 

as 

e .~l , fixed and ^ O • 

f L T J J T 

If ^== /» the limit is usually called "outer limit," and X the "outer 
variable", since in the boundary layer problem which motivated this formu- 
lation this limit presents a satisfactory approximation in the physical space 
away from the boundary. An "inner variable" and "inner limit" are obtained 
in many problems by putting £ ; the inner limit is an approximation in 
that region of the physical space where the differential equation changes 
order (or type) as <£ - — ^ 0. As the inner variable is kept constant, the 
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physical variable X tends to ^ as £ 5* O ; it is as if the problem is 

discussed in terms of "stretched" or "blown-up" variables. Theoretically 
of great importance are also the concepts of "intermediate variable" and 
"intermediate limit, " which are intuitively understood as obtained by a function 
J> where the order of magnitude is in between and Of ~L) . 

A more rigorous discussion is given by Kaplun in ref. 1. 

The formulation of a solution based on inner and outer limit is based 
on a "matching” of the two limits. But since there is no a-priori reason 
why the regions of validity of inner and outer limits should overlap, it may 
seem to be surprising that this has been so successful in many problems. 

It is here that Lagerstrom and Kaplun have contributed greatly to the under- 
standing of the problem by using the intermediate expansion to bridge the 
gap. In ref. 5 Erdelyi discusses this in some more detail, but (as here) also 
in an intuitive manner. 

The method by which a uniformly valid solution of singular perturbations 
is obtained is based on a conjecture and a theorem. The conjecture is: the 

solution of the limiting differential equation (obtained by subjecting the 
differential equation to the above defined limiting process) is identical with 
the limiting approximation of the exact solution. Thus, if an exact solution 
cannot be obtained directly, one can get an approximation (actually an 
asymptotic expansion) by solving the limiting differential equation. The 
validity of this conjecture is supported by a number of problems to which 
exact solutions are available. 
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In a singular perturbation problem it will be necessary to combine at 
least two limiting solutions (i. e. inner and outer) to obtain a uniformly valid 
solution, that is a solution valid in the entire physical space of the variables. 


Kaplun's extension theorem bridges the gap which may exist between the 
regions of validity of the limiting solutions. The formulation of the exten- 
sion theorem requires the definition of "equivalence classes". Let^/ and ^ 
be functions of £ , positive and continuous and tending to a definite limit 


as <£> — ^ 0 , then 

if 


(*£ ) belong to the same equivalence class 


0 


A partial ordering of equivalence classes is defined by 

A set -S of equivalence classes is convex if, for every ord^/ and ord ^ 
in 5, ord ^ X ord £i, ^ ord implies ord ^ is in S . Open and 
closed convex sets of equivalence classes are defined according to the usual 
definitions of set theory. The extension theorem may now be formulated as: 
If an approximation is valid to order £, in a closed set S its domain 
of validity may be extended to an open convex set S , containing S . 

Thus, the inner and outer expansions are valid in larger regions than 
those for which they were derived. The regions of validity of inner and outer 
expansions may now overlap or else they may be joined by an intermediate 
expansion. Whether the inner and outer expansions are matched directly 
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or by the use of an intermediate expansion, the matching is performed by 
using overlapping regions of validity provided by the extension theorem. 

It will be seen that in the earth- moon trajectory problem the matching can 
be performed directly without the use of an intermediate expansion. 

The following illustration may be of some help in understanding the 
meaning of the expansion theorem. In figure 1 the shaded areas in the X } £ 
space indicate the regions of validity of inner and outer expansions in a 
problem with singularity at X — O . 



Fig. 1 EXTENSION THEOREM 

The outer expansion is valid for a range of X bounded away from zero. 
The region for the inner expansion shows the typical behavior near the 
singularity: As £ tends to zero the physical variable X tends to zero 

also; the inner variable ^ remains finite. It is clear that for 

small S- the regions of validity of inner and outer expansions do not over 
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lap. But the expansion theorem provides for small additional regions of 


validity, indicated by the dashed lines in fig. 1. These regions can now be 


used to provide overlap with an intermediate expansion (obtained by intro- 

X 


ducing the intermediate variable = 


ffe) 


, ord £ 


ord / ) 


and matching can be performed. 

The plan for formulating a uniformly valid solution of a singular per- 
turbation problem is now clear. An outer solution of the differential equation 
is obtained, satisfying some of the boundary conditions. Typically, the 
boundary conditions near the singularity are neglected, but the outer solution 
must have as many arbitrary constants as there are neglected boundary 
conditions. Next, the problem is "blown up” in the region near the singularity 
by the transformation to inner variables. The boundary conditions which 
were neglected in the outer solution can now be satisfied by the inner solution, 
but the other boundary conditions will in general not make sense. Therefore 
the inner solution is partly indeterminate. To remove this indeterminacy 
the inner and outer solutions are ’’matched” as follows. The outer solution 
is evaluated at the inner region, the inner solution is evaluated at the outer 
region and these two functions are equated after the introduction of the trans- 
formation X = *7 — . Finally, a "composite solution” is obtained by 

f t(z) 

adding the inner and outer solutions and subtracting either the inner solution 
evaluated at the outer region or the outer solution evaluated at the inner 
region. This either /or condition reflects of course just the matching con- 
dition. The matching and the formulation of a composite solution described 


267 


here is possible when the regions of validity of inner and outer expansions 
overlap, if this is not the case the same procedures have to be followed on 
either side of an intermediate expansion. 

The extension theorem is the basis for success in matching; the con- 
jecture makes it plausible that the composite solution is uniformly valid, 
even though the inner and outer solutions themselves are only valid in their 
respective regions. 

The application of these principles to the earth-moon trajectory problem 
takes the following form. The equations of motion of the planar restricted 
three body problem (in non-rotating coordinates) are formulated with one of 
the coordinates, X , as the independent variable. Uniformly valid expres- 
sions are sought for the time and the other coordinate as functions of X and 
the small parameter , the earth-moon mass ratio. Near the earth the 
influence of the moon is seen in the equations of motion as a perturbation 
(proportional to ) of the Kepler equations. Clearly, in this problem the 
singularity is located at X" — 9* / , since near the moon the attraction of 

the moon itself is the major force. An outer solution is formulated in the 
physical variables X > tL and y 5 it describes the earth- centered part 
of the trajectory. An inner solution is formulated in the "blown-up" variables 
X , ^ and y , the differential equations for which show the moon's 

attraction as the major force. In principle the outer and inner solutions are 
asymptotic expansions of which the separate terms can be obtained by sub- 
stituting t = r[ o i- h, -h ^ // f H ' ' ' " 
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in the equations of motion, ordering the results according to powers of y(A^ 
and solving the equations for t c ^ y c ; h / j Y / , / in succession. A 

major result of Lager strom's and Kevorkian's investigation was the finding 
that, in order to formulate a first order solution, the outer solution must 
contain the correction of order ytu to the earth-centered Kepler trajectory. 
The reason is that the angular momentum near the moon (for a passage at 
distance of order y^ ) is of order y^ , and can thus only be defined when 
terms of order are included in the approach trajectory. The matching 

of inner and outer solutions is performed by equating term by term the results 
of evaluating the outer solution at X - / and the inner at X = - ; for 

this purpose the inner as well as the outer solution are expressed in the 
inner variable. The results of the matching are the elements of the moon- 
centered hyperbola and the phase constant of the moon. The composite 
solution is obtained by adding the inner and outer solutions and subtracting 
the outer expansion of the inner solution. From the form of inner and outer 
solutions it is clear that no intermediate solution is required. 

In their first paper on the three-body problem (ref. 8) Lagerstrom 
and Kevorkian treated the problem of two fixed force centers (the Euler 
problem). They discussed trajectories which leave from the center of the 
larger mass, the Kepler part of the outbound trajectory being a straight 
line^. The major result was that 1) a uniformly valid solution to order y<^ 
could indeed be obtained and 2) the outer solution must contain a correction 
of order^ in order to be able to determine the constants of the inner 
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solution. Because of the very special initial conditions the outer and inner 
expansions are of simple form and therefore the principles of the method are 
clearly demonstrated. In their second paper (ref. 9) they treated the more 
practical restricted three body problem with arbitrary initial conditions 
(although restricted to a neighborhood of order yjc near the earth). While 
following the same method in principle, the details of the analysis are some- 
what obscured by the added difficulties from the more general initial condi- 
tions and the motion of the moon. The following section refers in particular 
to this paper; it interprets and explains the method by lifting out the essential 
difficulties and omitting all easily understood details. References 10 and 11 
discuss some numerical aspects. 

The following section contains an outline and discussion of ref. 9. It 
is hoped that, by concentrating on the major difficulties, that section, together 
with the general discussion in this section, will be useful for the better under- 
standing and appreciation of the very interesting method of Lager strom and 
Kevorkian. 
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3. TWO- VARIABLE EXPANSION METHOD FOR EARTH- MOON 
TRAJECTORIES 


3. 1 Equations of Motion; Outer and Inner Variables 

In geocentric, non-dimensional, inertial coordinates A , y the equa 
tions of motion for the planar restricted three body problem are: 


X 


X+O-A*’) r 3 


-x 


“ ’fyyj 


where 


y f (t-tffa = M-fr Ss. -f - n 


>n. 


( 1 ) 


<T u f-3. 

ylL is the earth- moon mass ratio, /£<- = "77" , and the coordinates 

'*£ t '' m 

of the moon are 




( 2 ) 


T is a phase angle which is to be determined later. 

The goal is to formulate uniformly valid expressions (i. e. valid near 
the earth as well as near the moon) to order yU— for trajectories which 
leave from a neighborhood of order ,c<— near the earth and reach a neighbor- 
hood of order yOL, near the moon. 

The outer variables, to be used for the outbound trajectory near the 
earth, are the physical variables X ; s' ) ~£. . The inner variables will be 

chosen as 

CT _ X- (± -T) 

■> 


- _ X _ cV.i (t -T) rr y- ^ (t- -TD 

A — t" o / — " ~7TZ 


_ (t-T)-T 


( 3 ) 
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This choice assures that the motion near the moon is Keplerian up to 
and including the first order of and that the velocity far from the moon 

is of the same order (i. e. of order 1) as the velocity far from the earth. 

The additional phase angle 7T is introduced so that ~t- can be made to 
vanish at perilune. 

- 1/3 

It is interesting to note that if a scale factor of is used in the 

definition of )( and y and the time is left unsealed, the equations of 

motion in terms of X , y and i after letting => 0 are the Hill 

equations; these equations are valid in Hill's region, i. e. a region of order 

1/3 

near the moon. If an intermediate solution were required, these equa- 
tions could provide it. It will be seen that the inner and outer solutions can 
be matched without using an intermediate solution, although this cannot be 
expected a priori. Apparently, for the class of trajectories considered 
here (i. e. coming from a neighborhood of order near the moon), the 

passage through Hill's region is so fast that Hill's equations do not need to 
be considered. 

It will be convenient to introduce the coordinate X as the independent 
variable; the matching of inner and outer solutions is then done on the basis 
of distance instead of time. The equations of motion in the outer variables 
are then ^// 


1 h3 - / 

V* / & / 

— — --A" ^ ^ l—M') 

/ /z i /a r* <r 


( 4 ) 
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The equations of motion in inner variables, valid near the moon, are 
Keplerian up to and including the first order of and do not have to be 
written here. Terms proportional to the first power of ytC are not present 
because of the scaling of the variables and because the moon centered X , 
\J axes are taken parallel to the earth centered X , V axes. 


3. 2 Outer Expansion 

The right hand sides of equs. (4) represent small perturbations due to 
the moon; near the earth the solution of equ. (4) is thus nearly Keplerian and 
it will be convenient to specify the initial conditions of the trajectory by giving 
the values of the Kepler integrals. The integrals to be chosen are the total 
energy , the angular momentum the location of perigee and the 

time of perigee passage. In order to reach the neighborhood of the moon, 
the total energy must be & (l ) ; the initial velocity is thus and, 

since the trajectory leaves from a neighborhood of order yti near the earth, 
the angular momentum is 0 . Without loss of generality the perigee 

may be taken to be on the x-axis (on the side of the earth opposite to that of 
the moon). The initial conditions are thus 


at 


X = 0 : 

M* A 


(5) 

(6) 


perigee on x-axis 

and the time is specified by requiring that the Keplerian approximation is 


exact at X ^ O to all orders of 
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Since the angular momentum is of order ^- 1/2 it is clear that, for the 
class of trajectories discussed here, y is also of order ./C-- • The 

asymptotic expansions for t and \j may thus be taken to be 

t (t, X,A) ±,(X) + ■■■■ 


y i x } a) =/■ ^ ( x, a) -h A'Y, (x)+ 


and 


(7) 

( 8 ) 


The differential equations for , t f , j ^ and y f are found by 

substituting (7) and (8) into the equations of motion (4) and by ordering the 
results according to powers of • The equations for ~b Q and y_£ are 

of course just the Keplerian equations (equ. 4 with zero in the right handsides) 
and their solutions do not have to be repeated here. However, one detail 
must be pointed out. Whenever the parameter ^ appears as ( / - 
the nondime nsional gravitational constant, it is not subjected to the limit 
process. Furthermore, the angular momentum constant has been written 
as \ and for these two reasons the parameter appears thus in 

the expressions for the Keplerian part of the trajectory. This seems at first 
to be in contradiction with the principle of the singular perturbation method 
according to which the zero-order solution would be independent of the 
small parameter. Allowing the small parameter to appear in the Keplerian 
part results in somewhat more convenient expressions. The first part of 
~L 6< now wr itten as 


i c ( *>/ l ) = 4 , (*) ^4 M 


(9) 
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If the solution had been started with (t) * according to a strict 

application of the limit process it would be necessary to consider a separate 
"boundary layer" near the earth, because the relative orders of magnitude 
of the terms in 7 L ( X, /o) are different for X = &0) and X - D(/*-) • 

This nonuniformity has nothing to do with the moon's perturbation and is 


taken care of by letting appear in the Kepler ian solution. 

The equations for the first order corrections ~L f and y ( are: 

t, . 3 4 * 1, 


i *-00 

// 3 T //y 

cc 

, ✓/ 


“ /e 


( 10 ) 


l 


x! _ 4, y,' , x _ 


/<2 
'00 

With 

Because the initial conditions have been chosen such that the Kepler solution 
is exactly valid at X = £> . the initial conditions for ^ and y, are 
simply 

(o)= Yf (°) = 0 and ^ VW and y'fo) . 




00 


h 


(ii) 


3. 3 


and 


First Order Corrections in Outer Expansion 
The first order integrals of (10) and (11) are easily obtained as 



( 12 ) 


(13) 
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In principle ~L f and are thus obtained by quadratures but so far 

no analytic expressions for t: / and have been found. The functions ^ 


and 


1 ° 


are unbounded for X > I and their behavior near X — l can 


be studied by expressing the several parts of and ^ in Taylor series 
near X — I . The results are 


f M-= 




+ 


(14) 


_ -h P(\) < 15 > 

OtUr)%-[ 0-x) 2 - ' (i~~ x ) J 

where ^ ii) and fix) are the regular parts of l and {j ^ , 


and 


U = iX? Jfaj = T7-T) 

<~ C0 ( / 


(16) 


which is the X velocity of the Keplerian trajectory at X = / • 

Using (14) and (15) the first order corrections to the Keplerian part 


of the outer expansion may be written as 

sX 


fal c«>[%- 'tegywfy 


Since (at least to this time) no analytic solutions for k f and y f have 
been found, the complete trajectory can only be computed by evaluating the 
quadratures numerically. Clearly, this causes numerical difficulties be- 
cause of the singular behavior near X = / • It is of some help in establish- 
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ing a computer program based on this method that £ / (X) and y (*) depend 
on only one parameter, namely the total energy - /° Z . The corrections 
could thus be computed and tabularized once and for all. Also, near / 

and may be expressed much more simply as 

t, = Vy o-x) + f 00) 
y, = (n-^ z ) + ^(P) + 00) 

where and </~ are functions of the total energy alone. Unfortunately, 

y and gf become unbounded as > /, that is for the minimum 

energy trajectories. This difficulty has been treated in detail in ref. 6. 

Equ. (19) and (20), and particularly the functions Y and oT » play 
an important role in the matching of outer and inner expansions. 


(19) 

( 20 ) 


3. 4 The Inner Expansion 

The equations in the inner variables X , \j and i. have ylc* only 
to the second and higher powers. Since the present purpose is to develop 
a solution to first order in yUs the moon centered part of the trajectory is 
thus Keplerian and, in particular, hyperbolic. It will be convenient to 
characterize this hyperbola by the four constants 

U,j , the X component of velocity at X — — <=*= 

V / , the y component of velocity at X = — 

K = Ay-BU, related the direction, of the asymptote 

r ' ' *T~ 

and ■£ — 0 at perilune. 
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In the definition of f] f , 

/} __ a. B &*=■ & & 

CL is the semimajor axis, if the eccentricity and 6? is the counter- 
clockwise angle between the X axis and the apse line of the hyperbola. The 
expressions y ( X ) and £ (x) do not have to be given here (since they are 
Kepler ian) except as they are needed for the matching of inner and outer 
solutions. For this purpose their values as X — ^ - <^are needed. These 

are 



t = 


U 

La 


X - dL-BUj 


CL, 


f U a^ 2 - Li 


— AX 


( 21 ) 


( 22 ) 


as follows readily from the equations of hyperbolic motion (most conveniently 
by letting the eccentric anomaly approach- "°)« 

3. 5 Matching of Inner and Outer Solutions 

The purpose of matching the inner and outer expansions is to determine 

the constants of the moon, centered hyperbola, thereby also relating the 
singularities in the two expansions in such a way that they cancel each other 
in the composite solution. Because the singularities are logarithmic in 
nature in the inner as well as the outer solutions, such matching can appar- 
ently be achieved without the use of an intermediate expansion. 

The geometry of the matching is illustrated in Fig. 2, as much as it 
can be illustrated. The part of the figure related to the inner expansion is 
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drawn in the scaled coordinates ( X ; t j / and must be thought as infinitely 
small in comparison with the figure for the outer expansion. It may be 
remarked that this matching is strictly analytical, whereas the "patching” 
of conics is strictly geometrical. A direct comparison of the two methods 
is therefore difficult; such comparison should be based on the final numerical 
results. 



£ - i= tCi M +■ A- (X) /t-t ( (x) = T -h yU-h + X 

(x) + y,m y 

Figure 2 GEOMETRY OF MATCHING 

The matching is performed by evaluating the outer expansion at x = I 
and equating the result term by term to the inner expansion evaluated at 
X * — oo f both expansions being expressed in the inner variable. (The 
important thing is that both expansions are expressed in the same variable; 
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the present choice of inner variable is simply for the sake of convenience.) 

The part of the outer expansion identified with is evaluated near 

X — / by writing two terms of its Taylor expansion at \ = I : 


4 (*) = L (l} + tcc (,) 


using equ. (16). The inner variable is introduced by X — J-'- X / ^ ( t ~ 1~ ) 
from equ. (3), and if it is assumed that (t ~ T ) is small (as it will be 
shown to be), there results 


ufzic'-MU-Tya--'} <“> 

The introduction of the inner variable into the expression for 
(equ. 19) is taken care of by putting (l~X) - X , the term cos Ct ~ T) 

in equ. (3) being put equal to unity with enough accuracy since C, is multi- 
plied by . By combining equs. (7), (9), (19) and (23), the outer expan- 

sion evaluated near X=l and expressed in the inner variable is thus 


tm — ^ ec 0) x' Ll j/ 4 " 4 4 

f/X I) +0+iL x )*[Aj(-*)-b£yA ] / yjf {24) 

From equ. (3) and (22) follows for the inner expansion evaluated at X =«- - ->=> 



Now, if the phase angle T is chosen to be composed of several parts 


according to powers of 


A 


as follows, 
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(26) 


r- t c 


the third term in equ. (24) is to first order in Ay* , loTf, 




and the two expressions for 


if?) 


can be made identical by making the 


following choices for X , X, andT. 


/, 


= i.JO 


cc 


U. = £ 


r= - a 


/<! 


-^2., 


-dfuxr^. r(f) 


[x +i \~t-L- £"0)- 


(27) 


(28) 


(29) 


Note that this implies that ^ j ; this will be confirmed 

by the matching of the expansions for \j 

From equ. (8), (20) and (3) follow for the outer expansion of y 
evaluated near y = I and expressed in the inner variable , 


Z (Jhj' (-X) + c 



(30) 


Since the Keplerian part of this expression is multiplied with 9 its 

value near X ■= / is obtained simply by substituting / ; no Taylor expan- 
sion need be used here because the second term would be proportional to 

^3/2. 

From equ. (3) and the expression for the inner expansion, equ. (21) 
follows for the inner expansion evaluated near X ~ — 
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yft) -/'-[ % x - K] + 

where it is again assumed that ( i~T ) is small, so that /2CYV (i-~ ~7~) ~ (i ~T~) 
This assumption is shown to be valid (at least to order ) by equ. (25) and 
(29). If then the expression for £(x) near X= I (equ. (25)), and the evalu- 
ation of X (equ. (29)), which followed from the matching of the expressions 
for the time, are used, there follows for y (x) , 

-y (Y.) — - jjT X _ K t -t x U- + a. M-j (-% ) 

t 0)-^us' + T (P)-1, -] ( 31 ) 

The expressions (30) and (31) are made identical by the following choice 
of the constants ~f ^ and T , 


K=-/ 

- k, u i o) +j + St a 0 )- y(p) 


(32) 

(33) 

(34) 


~Tj is arbitrary 

The result V ■= —I confirms the expression CL ~ j ' t' llj J which 




was necessary for the time-matching since for the moon centered hyperbola 
U _ 2 • With equs. (28), (33) and (34) the moon 


l / 

centered hyperbola is now determined, the constants ,t} , X » 
being expressed as 


r 


and (j 


-y 


282 


■Si -- /fU* = 3-2 />" 

1 = If, u, 

\-'A 


P = U* +■ ( h-U. *) 

f = K, u r U t f/tu?) 


( 35 ) 


-A 


These four constants are really equivalent to three integrals because 

t L ! 

so that a fourth integral is still needed. This is provided by the condition 

that "t =*' 0 at perilune. This condition is satisfied by the proper choice of 

the phase angle ~T and the origin of the inner variable i- which are 

determined by equ. (27) for ~J~ , equ. (32) for ”77 , and equ. (29) for 2T . 

c ^ 

The constant A which is needed in equ. (29) is simply 

A = -JLr 

JtJi 

It is a fortunate circumstance that J , the part of the phase angle / which 
is proportional to yCL- , is ‘arbitrary. ~T^ influences ^ , and thereby the 

angular momentum l -KU,. With the hyperbola's total energy deter- 
mined by U- t , the perilune distance can thus be adjusted by changing the 
angular momentum through ~T . 

It may now be noted that the phase angle ~J~ (apart from the arbitrary 

contribution yU-~T t ) and two of the hyperbolic constants depend only on the 

Keplerian part of the outbound trajectory. As a matter of fact, Lagerstrom 

and Kevorkian derived T and 77 in the very beginning of their analysis 

c 
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and on the basis of the outbound Kepler trajectory alone. For the purpose 
of this presentation of their analysis it was felt that the modification in which 
~T and T, are derived from the matching conditions is a little more in 

0 -t 

line with the general principle of the method of singular perturbations; this 
principle being the determination of certain constants, which leave the inner 


and outer expansions indeterminate, from matching conditions. 

Furthermore, it is noted that the first order corrections of the outbound 
trajectory enter into the matching conditions only through the functions J f/°) 
(in the determination of T ) and ( rf (/°) ~ J ( f') ) ( in the determination of 

). The functions y and become unbounded as — =► I , i. e. for 

minimum energy trajectories, but the difference was shown to be 

finite (ref. 11). The difference fj ma Y be interpreted as the correction 

0 £ 9 required if were determined on the basis of the Keplerian 

trajectory alone. Since ^ is the y -intercept of the approach asymptote 
of the moon-centered hyperbola at X = l > it has been claimed that ( <( *V) 


is a measure of the error made in the usual methods of patched- conic 
computations; ( £- |)is then simply the miss-distance of the approach 
trajectory. Because of the basis difference in the two methods (which has 
been pointed out earlier in this report: patching conics is geometric, while 

matching inner and outer expansions is analytic) a comparison on the basis 
of ( 'J ~~ jjf ) tends to come out unfair for the patched- conic method. It 
would be interesting to see how the corrections ~L ^ and y contribute to the 
outbound trajectory near its intersection with the moon's sphere of influence 
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And if a thorough comparison of the two methods were desired, it should of 
course be based on final numerical results for representative trajectories 
computed by both methods. Lagerstrom and Kevorkian themselves have not 
provided such a comparison, except by pointing out that ( J - y) is a measure 
of the patched conic error; in ref. 5 there are comparisons with exact (i. e. 
numerically integrated) trajectories, but whether or not the results say much 
for the two-variable expansion method depends mostly on what kind of errors 
one is willing to except. 

3. 6 The Composite Solution 

The outer and inner solutions have been formulated and their singular 
behavior has been identified. By matching these two solutions in their over- 
lapping region of validity the phase angle and the constants of the moon 
centered hyperbola have been determined. To complete the work a composite 
solution must be formulated. According to the singular perturbation theory 
the composite solution is obtained by adding the outer and inner solutions 
and subtracting their common part. That common part is just the inner 
solution evaluated in the outer region (that is for at y — c*?), or the 

outer solution evaluated in the inner region (that is for X ^ | ); these 

two evaluations are identical because that was just the condition for matching. 
fiere it is convenient to use the inner solution evaluated for X >• — c>e>. 

According to equ. (8) the outer solution is 

'/ (t } MS) (\ M) f Jst y f ( x ) 

where and ^ are known functions, y exhibiting a singularity for x — > / . 


285 



According to equ. (3) the inner solution is 
where the moon coordinate ( £ —~T J ^ X ■== 


’-hi 

. (m _ ' ' 

with ^ = Cta (t — 7y and y ( x) is the equation of the moon-centered 

hyperbola. According to equ. (3) and (21), the inner solution evaluated for 
X > — is 


y ( x,a). = £. /■ ^ 

/ 7 L tr? 


with — ; / rz \ Y* Y — 

Gf ( X ; = 77^ X 






The composite solution for V is thus 

y ( , x / ,y =-/£*y^ ( x , a ) is /> ( *) + /l [ y ^ 

and in the same way the composite solution for ( X lu) is found to be 


-L(* A) = 4 ^ X y *A A M ^ M , 



(36) 


where 


/3 C x . ) = 





(37) 


If analytical expressions for ^ (X) and ^ (V) were available it would 
be observed that their singularities are cancelled by the singularities of the 
expressions in square brackets; this is for instance the case in the analysis 
for the two-fixed center problem (ref. 3). In the absence of analytic expres- 
sions for and y , the singularities must cancel numerically. Now, to 
determine just the geometry of the moon centered hyperbola (determined by 
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themselves are not 


the constants in equ. (35), the functions and 

needed, only the function (</" ^ ) is, U-r) depends on the initial con- 
dition — /^only and can be computed and tabulated once and for all. However, 
if the time-dependency and the entire trajectory is needed, the functions 
and y ( , as well as the expressions in the square brackets of equs. (36) and 
(37) must be computed and their singularities made to cancel numerically; 
this may be expected to cause some numerical difficulties. 


287 



4. THE OUTER EXPANSION IN ROTATING COORDINATES 


In the previous section it was shown that the application of singular 
perturbation theory results in a uniformly valid solution to first order of 
for a certain class of trajectories in the restricted three body problem. In 
principle this is a satisfactory solution, but practically there are some 
difficulties because this solution is left in terms of quadratures which must 
be numerically integrated. Furthermore, since the formulation was carried 
out in a non-rotating coordinate system one may ask whether a formulation 
in a different coordinate system would be more advantageous. 

Therefore, in conclusion, the following items are cited as possibly 
leading to improvements or analytical simplifications for this type of first 
order solution: 

1) to obtain analytical approximations for the quadratures which 
depend on some parameter of the zero-order ellipse (in this case the energy); 

2) to represent the problem in a rotating coordinate system as a third 
order system of differential equations by making use of the Jacobi Integral. 

An investigation of the second recommendation has been initiated and 
in what follows the results for the outer solution are outlined in terms of 
quadratures. As a result of this investigation it was found that in addition 
to the choice of a rotating frame of reference the choice of polar coordinates 
was a decided advantage for the following reasons: 

1) The solution for time is obtained from the first order differential 
equation provided by the Jacobi Integral; 
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2) The occurrence of elliptical integrals in the zero-order solution for 
the time is avoided when the radius is used as independent variable; 

3) A solution in polar coordinates readily lends itself to extension to 
three dimensions. 

The details of this analysis follow. 
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In the planar restricted three body problem assume a non- rotating 

earth-centered coordinate system with axes X, Y parallel to some inertial 

axes and let the earth-moon distance equal 1 while the masses of the earth 

2 

and moon are \-ju- and respectively and the gravitational constant k =1. 
The Lagrangian for a massless particle at (x, y) is from Reference 7: 


L = 1 


(x^ + y 2 ) + — - yLA- (x cos t + y sin t) 


(38) 


2 ' r r 2 

In this system the moon rotates with angular velocity U) = 1 and the 
transformation to a rotating coordinate system X*, Y* with the moon at unit 
distance on the X* axis is: 

x* = x cos t + y sin t 

y* = -x sin t + y cos t 


(39) 


where in polar coordinates relative to the rotating coordinate system: 

x* = r cos 0 :,: 

(40) 

y* = r sin 0* 

and r = r*. The Lagrangian in relative polar coordinates 


becomes: 


L = - 

2 


r 2 + (r 0*) 2 


+ r^ 0 ' + 


1 -A 


+ Jh - /i r cos 0 ‘ ( 41 ) 


where: r 2 - 


2 1 1/2 

i 

»— * 
ro 

i 

i 

1 + r - 2r cos 0 = 

L J 

r j 


1/2 


Since L"‘ is time independent there exists an integral of the equations of 


290 


motion known as the Jacobi integral which is equal to the Jacobi Constant C' 
for the relative energy. Thus the expression for the relative velocity becomes: 


r 2 + r 2 (6*) 2 = r 2 + 2 + 2 ^ -2 u. r cos 9* - C' 

r r 2 


(42) 


An asymptotic series solution of the following form is to be obtained: 

,2x 

(43) 


t = t Q (r) + Ji. tj (r) + 0 (,ic ) 

9* = e'o (r) + M ef (r) + 0 (/4. 2 ) 


where yU . 01. The zero order solution is a two body ellipse relative to 
the earth with elements a and b, e, i, c u\ f 'T and constant angular 

momentum to and energy h Q . It will be assumed that the initial conditions 
are taken at the perigee. Then the solution for t is essentially a first order 
approximation to a "Kepler* s Equation” for a special class of lunar trajectories 
in the planar restricted three body problem and t Q is exactly Kepler f s equation 
for the two body problem: 


t = 
o 


a 

Ujt 


- 1 / a - T\ 

cos ( ) - e 

ae 


i- ( ^i) z 


ae 


(44) 


and 0 q is given by: 


0 * 1 '* 

= cos 
o 


-1 


a(l-e 2 ) - r 
re 


Y : 


where 4^ is an initial phase angle between the semi major axis and the X‘‘ 

axis. Such a zero order solution is valid since Lagerstrom and Kevorkian, 

Ref. 4, have shown that within a small neighborhood of the earth of 0 (yuT* ) 

1 “ 1 “ 

the motion is Keplerian up to order /l . Hereafter the subscript zero 
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refers to values for the zero order solution and the subscript i refers to the 
initial conditions. 

The Lagrange equation [l] 0 = 0 provides the following expression 

for the change in the total angular momentum: 


_d_ 

dt 


(r 2 0* + r 2 ) = A 


- r sin 0'*' 

( r 2) 3 


+ r sin 0 


(45) 


Integrating for a first order approximation gives: 



+ r sin 9 




dr 


(46) 


Clearly the integrand in equ. (46) is expressible as a function of r through 
eqs. (44). However due to the transcendental nature of the resulting expression 
for the integrand an analytical integration cannot be obtained directly. Instead 
an approximation for the integral dependent on certain parameters of the zero 
order solution can be determined and exercising choice as to the form of the 

• 'e 

approximation will allow some simplification of the solution for t. Now 9 


becomes: 


e" 

o 


+ 




1 + A- P (r) 


(47) 


where the approximation for the integral has been incorporated in P (r). 

Now the Jacobi Integral, equ. (42) provides a first order differential 
equation for t after substituting for 9 - ': 
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K) Z + 2 = 


(2 i - 2 C) r 2 + 2 (1-/6) r - L\ + 2^r 2 


, r 2(0) 


P(r) (r 2 - / ) - A - r cos 0" 


(48) 


and 


to 


(2 4 - 2 C).r 2 + 2(1 -A) r - 1; 


where = 2 ( $ o - h Q ) is the energy constant for a two body orbit relative 

to a rotating reference frame and A = C* - ^C. Note that in equ. (48) both 

— - and P(r) become unbounded as 0; however, the combination 

dr 

of these terms should remain bounded insuring that — is bounded near the 

s dt 

moon. 


Similarly 0"' ' is obtained from equ. (48): 


+'/*«! 


-A- t' Q - 1 ’ + A -^4- ti + ^ p(r) 


(49) 



where again the prime denotes differentiation with respect to r. 

This completes the outline of the outer solution. A similar investi- 
gation of the inner solution and the results of matching the solutions will be 

* 

final deciding factors in the determination of the practicality of this approach. 
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5. CONCLUSION 


Interpreting the restricted three body problem as a singular perturbation 
problem results in a uniformly valid solution to first order in the small paramete 
yLL for earth-moon trajectories. This solution can be thought of as being com- 
posed of an "outer solution, " valid near the earth and an "inner solution, " valid 
near the moon. The outer and inner solutions are matched in their common 
region of validity by determining certain constants (i. e. the initial phase angle 
of the moon and the elements of the moon-centered hyperbola) in such a way 
that the singularities which appear in the inner and outer solution vanish in 
the construction of the composite solution. The matching constants are ex- 
pressed in terms of the initial conditions, with the exception of a part of 
order /L in the phase angle which can be chosen arbitrarily and can thus be 
used to adjust the lunar perigee distance. 

It has been shown that the outer solution must necessarily contain a 
part that is proportional to the small parameter yU- in order to make the 
match with the inner solution possible. A posteriori this conclusion could 
have been anticipated from a consideration of the order of magnitude of the 
angular momenta of inner and outer solutions. The need for this first order 
correction to the earth- centered outbound ellipse seems to explain why the 
usual patched conic methods (in which such a correction is not made) must 
be inaccurate. But such a statement must be made with some care, since 
in the two methods the matching is performed on a very different basis. In 
the two- variable expansion method the outer solution is evaluated at the 
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moon's distance and equated to the inner solution evaluated far away from 
the moon, but far away in terms of the "blown-up" inner variable. Although 
this procedure makes good sense analytically, it is hard to see what it means 
geometrically. On the other hand, in the patched- conic method the earth 
centered ellipse (an uncorrected outer solution) is evaluated at the sphere 
of influence of the moon and equated to the moon centered hyperbola (the 
inner solution, but in physical variables) at that point. To make a sound 
comparison of the two methods, it should be based on the final numerical 
results, or at least one should determine how much the first order correction 
of the outer solution contributes to the Kepler ellipse up to the moon's sphere 
of influence. 

The composite solutions, in particular the first order correction, is 
left in the form of quadratures for which no analytic expressions has been 
found yet. Therefore, although in theory the singularities of outer and inner so 
solutions cancel, the singularities must be evaluated numerically. This will 
cause numerical problems if the entire trajectory is to be known as a function 
of the time. On the other hand, if it is sufficient to just know the elements 


of the moon centered hyperbola, the quadratures need not be evaluated entirely. 
Only the parts of the first order correction indicated by y ((°) and <f(fy 
are required, and in particular their difference u- r) . These functions 
depend only on the total energy — and can be evaluated once and for 


all for any interesting range of energies. There is an additional difficulty 
since cT and tend to infinity for minimum energy trajectories, but even 
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there the difference u- r) remains finite. 

These difficulties may limit somewhat the practicality of the methods 
depending on how much trouble one would want to go through to write a com- 
puter program that evaluates the quadratures. Even so the method is of 
great interest and a similar development may be attempted along some 
different approach. Such a different approach is given in Section 4. 
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DEFINITION OF SYMBOLS 


t 

f 

R 

r 

G 

P 

Q 

J. 

2 

k 

e 

g 

P 

q 

a 

a 

n 

K 2 

A 


Time 

True anomaly 

Position vector 

|R I = magnitude of R 

Gravitational constant 

Angular momentum vector 

Eccentricity vector 

GxP 

Unit vector in direction of x axis 
Unit vector in direction of y axis 
Unit vector in direction of z axis 
Eccentricity 

!— ! 

1*1 

IQ I 

Time of perigee passage 
Semimajor axis 
Mean motion 

Coefficient of second harmonic of the potential due to the oblateness 
of the earth 

3/^ 

4 

g 
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(r,0) Polar coordinate system introduced in x-y plane 

SUBSCRIPTS 

1,2,3 1st, 2nd, 3rd component of a vector 
o Initial value 

s Short periodic 

Z Long periodic 

SUPERSCRIPTS 

• Differentiation with respect to time 

Differentiation with respect to true anomaly 
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APPLICATION OF VARIATION OF PARAMETERS 
TO THE POLAR OBLATENESS PROBLEM 

By 

John Morrison 
Henry Weinberg 


SUMMARY 


This report presents the derivation of a set of two body parameters and 
their associated perturbation equations. These equations are applied to the 
polar oblateness problem characterized by the second spherical harmonic. A 
modified Poisson method is used to obtain the first order solution to the problem. 
The modification of the method is introduced in order to eliminate the occurrence 
of secular terms which, because of the parameters employed, would have caused 
a rapid deterioration of the solution. The approximate solution is expressed as 
a function of true anomaly. Some analysis of second order theory is presented 
which suggests that difficulties with particular initial conditions may be avoided. 


INTRODUCTION 

Among the numerous troublesome aspects which one encounters in attempt- 
ing to integrate the perturbation equations for the polar oblateness problem, two 
difficulties may occur which appear to be subject to, at least some amelioration. 
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In general, there are two decisions one must make before these difficulties be- 
come apparent. These decisions consist of selecting a set of parameters and a 
method of integrating the perturbation equations. The possible sets of two-body 
parameters may be divided into two groups, one of which contains canonical 
parameters and one which does not. Two methods of integration, in general 
use, are Poisson’s method (1) and Von Zeipel's method (2). The latter method 
is applied only to canonical parameters. In most instances, regardless of the 
set of two-body parameters or method of integration employed, the results 
present two interesting properties. The first is the occurrence of terms in the 
approximate solution which show a secular growth. The second is the presence 
of singularities in the second order corrections for certain initial conditions of 
the parameters. The first property is not, in general, objectionable since the 
secular terms usually appear in the expressions for angle parameters. How- 
ever, for some parameters, such as the unit perigee vector, the occurrence of 
secular terms destroys the unit characteristic and limits the applicability of 
the results to relatively short time intervals. 

It is proposed in this report to derive a set of parameters and their as- 
sociated perturbation equations which, when applied to the polar oblateness 
problem, yield, after approximate integration, equations for the parameters 
which manifest no secular growth to the first order, except for one element. 

A brief analysis of the structure of the second order perturbation equations is 
developed which suggests that the occurrence of singularities arising from 
initial conditions is not a necessary concomitant of the polar oblateness problem. 
The application of second order theory , however , will not be attempted in this 
report, because the parameters which have been chosen degenerate for nearly 
circular orbits. Even though the set of parameters employed is defective, the 
comparative simplicity of the perturbation equations recommends the use of 
these parameters for a clearer insight into the particular difficulties which their 
use is intended to eliminate. It should be noted that the degeneracy of the para- 
meters for nearly circular orbits is not a case of replacing one difficulty with 
another, but is simply a consequence of the choice of parameters and not of the 
integration technique. A more judicious choice of parameters has been made 
and an improved integration technique developed which eliminates the imper- 
fections in the present method. A report is now in preparation which incor- 
porates these developments. 
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DERIVATION OF A SET OF PARAMETERS FOR THE KEPLER PROBLEM 
To specify the solution of the vector equation 


R + 



= 0 


( 1 ) 


six independent parameters are needed. For the purposes of this report, the 
following set will be used : 

a , the time of perigee passage; 

P, the eccentricity vector; 

Q, a vector perpendicular to P and lying in the plane of motion. 

At first glance it would appear that this set contains seven independent 
elements; but, since P^ and Q are mutually orthogonal, any one component may 
be expressed as a function of the remaining five. The vectors P and Q may be 
obtained from Eq. (1) in the following manner: Take the cross product of R and 
Eq. (1) 

• • 

R x R = 0 (2) 

Integration of Eq. (2) gives 

R x R = G (3) 

in which G is the constant angular momentum vector. Now take the cross 
product of Eq. (1) and G 

•• uR 

RxG+-^~ xG=0 (4) 

r rf 

After expanding RxG using Eq. (3) and recalling that G is constant, Eq. (4) 
integrates to 


RxG- 



( 5 ) 
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in which Pisa constant vector. To find the magnitude and direction of P 
rewrite Eq. (5) in the form 


P = r(r • R -^)-R (R* R) ( 6 > 

Evaluating Eq. (6) at perigee yields 

P=U pie ( 7 > 

P 

where 

e is the eccentricity of the orbit 

and U is a unit vector in the direction of perigee. Let Q be defined by 
“P 

g=GxP^RxG + Rg 2 (8) 

The magnitudes of G, P,and ^ are g, p = pie, and q = gp, respectively. 

Since R, P.and Q are coplanar, R may be expressed as a linear combina- 
tion of P and Q 

R = a 1 P+a 2 Q ( 9 ) 

The scalar product of Eq. (9) with P yields 


R ’ R r cos f 



where f is the true anomaly of R. Similarly , 


R ’ 9 . r sin f 



R may be written as 

Making use of the well known formulas 


( 10 ) 


( 11 ) 


( 12 ) 


r = 


r 


pi (1 + e cos f) 


(13) 
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( 14 ) 



it follows that 


a, = Msinf 
1 gP 


_ e + cos f 

2 gq 




(15) 

(16) 


PERTURBATION EQUATIONS 

After having obtained a set of parameters the first step in deriv ing the 
perturbation equations is to introduce the perturbing force F on the R. H. S. of 
Eq. (1) which gives 

•• uR 

R + ^g- = F (17) 

The perturbing force F will cause R to deviate from the Keplerian orbit, and a 
new solution must be found. This solution can also be put in the form of 
Eq. (9), but now the parameters G, Pand q will be functions of time. In order 
to determine the time dependences, it will be necessary to obtain the differ- 
ential equations for the parameters in so far as they depend on the perturbing 
force F. 


Differentiation of Eq. (3) gives 
G = R x R 

Substitution of Eq. (17) yields 
G = Rx F 

'Similarly, differentiation of Eq. (5) gives 

• *• • R x G 

P = RxG+RxG + pi ~~ ~ 


(18) 


(19) 


( 20 ) 
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Substituting for G and R yields 
• • 

P = F x G + R x (R x F) 

From Eqs. (8), (19) and (21), Q is given by 

£ = ^Rx(RxF)+Fg 2 + 2R(G* G) 


(21) 


( 22 ) 


The equation for the variation of cr , the time of perigee passage, is deriv- 
ed from Kepler's equation, which, for 0 < e < 1, takes the form 


, , -1 . ,V1 - e 2 . , eVl - e 2 

n (t - cr) = tan sin f e+ -- f -smf YTV^f 


(23) 


where n = 7 — % and g -J+ pa (1-e ). 


+a 


For e > 1, Kepler's equation is given by 


, -1 . e 2 - 1 ■ a eV e“ - 1 

n (t - a ) = tanh sin f . ^ - sin f 


e + cos f 


1 + e cos f 


(23') 


-a 


where n — and g = ^ sl(o -1). Using various identities, Eqs. (23) may 


be put in the following form 

-l R * R 


R • R 


n (t - cr ) = tan 


(1 - — ) a 2 n a 2 n 
a 


n (t - cr) = tanh 


^ R • R 


R • R 


(1 - — ) a 2 n a 2 n 
v a 7 


(24) 


(24’) 


Differentiation of these equations with respect to time, and substitution of 

• • 

Eq. (17) for R gives, in either case 


cr = F • {-^7 R(t-cr) + ^R+^ 
P Up 


2 


(1-e 2 ) (R • R}R-|P j} (25) 
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where 


• • 


and 


1 R • R _ 2_ 

a n r 


*=£• i O 2 !-) 


It is convenient to have available the total time derivative of true anomaly. 
Differentiating the expression 


R P 

cos f = — • — 
r p 


it follows that 


(26) 


- (sin f) f = 


r R _ R r P R /' P 


p r m 

p + r 'vp ) 


(27) 


and therefore 


2 p q 


(28) 


APPLICATION OF THE PERTURBATION EQUATIONS 
TO THE POLAR OBLATENESS PROBLEM 


In this report, the polar oblateness problem wiU be assumed to be char- 
acterized by the perturbing potential 


$ = 





(29) 


In order to apply the perturbation equations, previously presented, to this 
problem, it is necessary to specify the perturbing force F. This force is the 
gradient of the perturbing potential $. 
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F = - 


(30) 


' 


{ 1 - 5 — 2 R+ 2zk} 


The procedure for applying the perturbation equations may be outlined as follows: 

(a) Reexpress the perturbation equations in terms of the parameters 
P, Q, and G, and true anomaly, f, by substituting Eqs. (9), (12), 
and“(30) for R, R, and F, respectively. 

(b) Since the resulting, equations are functions of true anomaly, it is 
legitimate to take f = g/r 2 , for a first order approximation. It 
follows that the differential equations with respect to time may be 
transformed to differential equations with respect to true anomaly. 

(c) These perturbation equations are now written as Fourier polynomials. 
Terms with constant coefficients are transposed to the L. H. S. 

(d) To obtain a first order solution for the system of equations derived 
in (c), all parameters on the R. H. S. and the parameter g, wherever 
it occurs, are held constant. Under these conditions, the system 
can be solved exactly. 

(e) The perturbation equation for the parameter a is treated similarly 
with some modifications. 


Carrying out the operations indicated in (a) , (b), and (c) the results are: 


3jU K 9 

g 


r. Q 3 P P 3 Q 3 Q r 3 p 3 5 Q 3 ~ 

t-q ppq - qL 2 n 2 2 q 2 J 


} 


g 


+ e 


(31) 


= 3 - pL {k[^- (jL. sin 3 f + e sin 2 f + ~ sin f ) - cos 3 f 

cos 2 f - ^ cos 0 J + f lCt - ) (%sin5f + f sin4f 
+ ( l +i fr ) Sin3f+ 3e sin 2 f + (\+ §|- ) sin f) - cos 5 f 

+ 3e cos 4 f + (^ + ^ e 2 ) cos 3 f + 4e cos 2 f + ( | + ^ e 2 ) cos f^) 
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^ Q 3^V5e 2 . c , ,3e . 4 . ,7 lle 2 ^ . 

- ^ — J ^ -jg- sin 5 f + 2“ sin 4 f + (- + -jg— ) sin 3 f + e sin 2 f 


+ — |) sin f sin 3 f + e sin 2 f + (1+ —) sin f)J 


+ “ -1) cos 5 f + cos 4 f +(^ + ^ e^) cos 3 f + 3e cos 2 f 


+ (f ♦ % e 2 ) cos I ) + (^) 2 ( £ cos 5 f ♦ * cos 4 f ♦ ( \ + “ e 2 ) cos 3f 

7 9e^ 'N ^3^3 S' 50^ 7 o 

+ e cos 2 f - (^ + -g— cos ij - — Q sin 5f + 3e sin 4f+(| +3e ) sin3f 


2 2 2 

+ 4e sin 2f+(|- - sin f) + cos 3 f + ecos2f + (1+ cos f}]} 


(31) cont'd 


Q -.^{. k !3 + Pr5!/ + 3^_ l1 + 2^S } 

3 ^ “ P P L 2 2 2 2 jqpqJ 


g 

3 M 3 K 2 

g 3 


f r P 3 /'g 2 3 e 2 n Qo ^2 

|-k[_ — (^2" cos 3f+e cos 2f + -g- cosy + Q-g- sin 3 f 


+ e sin 2 f + sinf)]+— C%" cos5f + fr cos4f 

+ (|+ || e 2 ) cos 3f + 4e cos 2f + (|+ ^ e 2 ) cos f) ~(^“) (^^fg cos5f 


+ cos 4f+ (| + cos 3f - (I + | e 2 ) cos ()+ ^2. (s£ 

% ° y p q V 2 


sin 5 f 
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2 i ^ .2 

+ 3e sin 4 f + (| + 3e 2 ) sin 3f + 4e sin 2f + (|- - g) sin f) - Q-j- cos 3 f 


+ e cos 2 f + (1 + ^f-) cos f) ] +-| [( -~) C% sin 5f + ^ sin 4 f 


2 Q 2 2 

+ ( 1 + jg e 2 ) sin 3f + 5e sin 2 f + (\ + ^~) sin f) - sin 5 f 


2 ^ . 
+ ^ sin 4f + ( \ + ^j|- ) sin 3 f + e sin 2 f + (^|- - |) sin f) 


‘O-L f 5e cos 5 f + 3e CO s 4 f + (1 + ^ e 2 ) cos 3f + 6e cos 2 f 

p q V 8 z o 


2 2 2 
+ (i + ^|-) cos f) - (jg sin 3f+e sin 2f + (1 + 5|_ 


) sin f)_ } 


(31) cont'd 


G , + !L!i{l!_3 + 9?3} xk 

- 3 l p p q q J ~ 

D 


2 I 

J 


J P _3 Q cos 3 f + cos 2 f + ^ cos f) + ~ (j sin 3f + sin 2 f 


e . 

+ 2 SU1 


Q, 


f)] + sin 3 f + sin 2f + | sin f)+ |cos3f-cos2 f 


e c 
+ g cos f 

where 


x k 


( 



.) 
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Consider the system of homogeneous equations obtained by setting the 
R. H. S. of Eqs. (31) equal to zero. 


- /„ %, 2, P * r 3 V . 5 Su 


P - 
“1 „ 4 

g J5 


( k ^3i -f£f3 

u q* p * p * q * «* L2 


2 + 2 


-]H 


S i " _ 4 




g 


tip,VU! *!j-TTrrH (32) 


i 


l Z 


G , + A p 3i . 9 , « 


-i 4 
6 i 


|£i 1J* + fl > 

l P, P„ q £ 


r Z *SL 


x k= 0 


It will become apparent that P , Q , and G represent the long periodic 

jC Xj jC 

terms of P, Q, and G, respectively. 

For this system of equations, Eq. (8), Q =G x P still holds. Since 

X> H Xj 


p = P • P 

y l ~Z -l 


q = Q * Q 
4 j l s je ~Z 


g = G • G 

e z -z -z 


It follows from Eqs. (32) that 
p' =Zl . p' =o 

* p, 


Q 

q' = _1 . q' =o 


(33) 


G 

g' = — ■ G' =0 

* z g £ - z 


Therefore, for this system of equations, p , q , and g are constant. 

Xj Xj Xj 


Similarly, 


p„ v p, y q„ v q* y 


(34) 
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so that 



(35) 


is constant. 

Using the identity 


kx <k = zl Q -2L 


'" g i p i q i 


% p « 


it follows that 


( 


P,P, 9, \ J - S h p l h - 


G 




U 

Z i L V SL % 


p ;j=if^ g 


i 


Therefore, Eqs. (32), can be rewritten as 




Qc 





3q 2 K2 

where A j— 


The third components of P ^ , Q'^, and G' ^ are 


(36) 


(37) 


(38) 
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( 39 ) 


p V Ap ^(f B - 2 ) =0 

q , 3 i + ■^■( 2 -| B ) =0 

G V° 

which form a system of first order, linear, homogeneous differential equations 
with constant coefficients. The solution is 


P 3e - p 30 008 { A Cl B - V f } - So sil1 { A Ci B - 2 ) f } 

Si “ P 3o 3in ( A ( 1 1 8 - V 1 } + So cos { A (I B ■ 2 ) f } < 40 > 


°3£ G 3o 


where Pg o , Qg Q , and Gg Q are initial conditions. Similarly, the first two com- 
ponents of G ' are 


G V AG 2 ,% £ = # 

l 


G '2, +AG 1^ = ° 


(41) 


This system has the solution 



where G^ q and G^ q are initial conditions. Using the identities, 
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(43) 


G lX °3 i_ 


g * g * 


P * 


q i q X 


G 2£ G U _ P 2j P 3£ Q 2l 


h h 


P * P X 


q * ^ 


Eqs. (41) may be transformed into 


o' + A g {^^ + ^^4=0 
G l« +Ag ^p,P i * t % > 


G ' _ Ag 

U 21 Ag Z l V t % % J 


(44) 


Eqs. (44) together with the identities 


G U _ V 2i %i 5§i 


e t 


h % 


h q i 




g, 


p * q * 


9 i ^ "e 

determine the remaining components of P and Q which are 


(45) 


P n ~ AB 1 p. v h J h %' 


*2 o AB l e a. p. ^ R„ J 


2l AB 


g„ q 


S/ 


o A( a 3JI 

Q U “ ab l A g £ p £ 


Q 


3^ 


G 


2 A 


gi 


(46) 


Q 2^AB I" g^ p £ q A ^ g, 'J 


Q t 


G, 
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All quantities appearing on the R.H.S. of Eqs. (46) are known. After 
some algebraic manipulation, the solution for the system of Eqs. (32) maybe 
expressed as 



To find the particular solution of Eqs. (31), assume a solution of the form 


(47) where P Q , Q q , and G q are functions of f. Substituting solution (47) into 

the L. H. S. of Eqs. (31) will yield three equations for P ', Q and G '. 

— O “O — o 

After solving for these derivatives, and recalling condition (d), P , Q , and 
G q may then be found by integration alone. If the second order terms in this 
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solution are neglected, the results are equivalent to integrating the R. H. S. of 
Eqs. (31) and adding the results to solution (47). The first order solution for 

P, Q, G, is 


p = p e +[? s 


§ =% + L§s! 


(48) 


where P , Q , G g , are the integrals of the R. H. S. of Eqs. (31), and the 
quantities in brackets are to be evaluated between the limits f and f^. 

In the perturbation equation for ex, Eq. (25) it may be noted that 


d$ 

R • F = - 3 $ R * F = -ft 

If the parameters a and <y are held constant at their initial values 
j r 3a -s c 3a . a n— T 


(49) 


Therefore, Eq. (25) may be rewritten in the form 


d_ 

dt 



2 r 


e 2 )(R 


R) 


R - - p) • F 
— a — J — 


(50) 


Differentiation with respect to time is transformed to differentiation with 
respect to true anomaly, and the R.H.S. is expressed as a Fourier polynomial. 
The result is 
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d 

dt 


r 3a 0 i f /'^ > o\2 r 

|a + — 4>( t -a ) \ = -—£- -(— ) 

L M ° J „3„ L v P y L 


P - 2 -5e 2 


g 3 p 


p ✓ L 16 


cos 5 f 


3e 

2 


+ cos 4 f + 


C 5e 2 7 n 

\ _ _ 3e „ / 

^5 5e 


V 16 + 4> 

) cos 3 f + ~ cos 2 f + ( 

^4 " 1 

r >osfJ 


+ C^) [%-cos5f + ^cos4f + (|-^e 2 )cos3f 

“T cos 2 f + Ct -£cosf]-^[^ Sin 5 f- | sin 4 f (51) 
+ ^V“l) COsf ] "1? Sin5f-|sin4f+(l -^)sin3f 


2 2 

+ 3e sin 2 f - Q - + |) sin f] + [-• cos 3 f + e cos 2 f 

+ C 1 " t) cos f ]} 

Holding the parameters on the R.H.S. constant, Eq. (51) is integrated to 


yield 


3a $ 


L wa t - 

a (t - a ) 

s n v o' J, 


(52) 


where a is the integral of the second member of Eq. (51). 


CONCLUSION 

The solution (47) obtained has f appearing in arguments of sines and 
cosines, these terms having two essentially different periods: 2n/j (short 
period where j is a natural number), and 2ir/A (long period where A is a 
small quantity and equals S^Kg/g 4 ). The solution is well behaved for all 
values of f because f appears in arguments of sines and cosines and because 
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these functions are found only in the numerator. This would not be the case if 
Eqs. (31) were integrated keeping all parameters constant; for then, the long 
periodic terms in the previous solution would be replaced by their first order 
approximations. This solution would grow linearly with time. 

The next step in the usual procedure for deriving the second order 
approximation consists in substituting the first order solution for the parameters 
- in Eqs. (31). Before this step can be carried out, however, it should be recall- 
ed that -Eqs. (31) were obtained by putting dt/df = r 2 /g. If higher order solutions 
are to be found, this approximation is no longer valid. Therefore, for a second 
order approximation, dt/df must be replaced by its first order approximation 
derived from Eq. (28). 

Now suppose the parameters are replaced by their first order solutions, 
terms of order K 0 3 are neglected, and products of trigonometric functions are 

Li 

replaced by trigonometric functions of sums. Under the following conditions, 
the resulting equations may be integrated to give a well behaved second order 
solution : 

(a) No constant terms are present 

(b) Whenever cos af or sin o?f occurs (a a small quantity), or must 
also appear as a factor in the numerator. 

If these conditions are not fulfilled, and the equations are integrated, 
f may occur outside trigonometric functions, or small divisors may be present. 
A possible solution to these difficulties is obtained as follows : 

(a) Denote the short periodic terms of the first order solution of 

p, Q. G by P s (P 0 , Q 0 , f), 9 S <P 0 , § 0 , 0, G s (P 0 . 9 0 . i) ”"3 

assume a solution of the form P = P^ + P g (P^» Q^» f)> Q = 

+ 9 S (P«. a t . t>, G - G t + G s (P r g t . f) P t . 9 t . «, are new 
variables which, to first order, are equivalent to solution (47). 

(b) Substitute these expressions into both sides of Eqs. (31) as 
modified in accordance with the qualification regarding dt/df 
mentioned above. Neglect terms of order Kg 3 ; expand into 
Fourier polynomials, and neglect terms multiplied by sines 
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and cosines. P^, Q^, are determined from the resulting equations. 

Investigations are currently being pursued for the purpose of finding the 
second order solution by this method. 


APPENDIX 

EXAMPLE OF RAPIDLY VARYING PARAMETERS 


Whenever perturbation equations for a set of parameters are solved employ- 
ing an approximate integration method, it is always desirable that the parameters 
be slowly varying. It is likely that, for the polar oblateness problem, no set of 
parameters exist in which all elements possess this characteristic. An example 
is presented to demonstrate the existence of rapidly varying parameters for the 
polar oblateness problem. Consider the equation 


z + 





z + 2 z 


] 


which is obtained by taking the scalar product of Eq. (30) with k . Given the 

• “ 

initial conditions z (t Q ) = z (t Q ) = 0 , it follows that all derivatives of z evaluated 
at t - t are zero. Therefore z is identically zero. 

In the following example it is to be assumed that this is the case. Then 
G = RxF = 0 orG = Ggk where G^ is a constant. Now introduce a polar 
coordinate system, (r, 0 ) in the x-y plane. From Eq. (30) two scalar equations 
result: 


•; _ , 0*2 _ y _ 3 ^ K 2 

r - r(0) — 

r r 


1 d , 2 „ 

T 5F (r e ) = o 


A particular solution of these equations is given by 
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la 3uK 2 
r = V e=9 o ±1: V“% + “5" 

r o r o 


where r . 0 are constant. Since 
o o 


- U 2 9 ! =h» 


3 uK„ 


and 

2 

e cos f ' 1 
it follows that 

3K 2 

e cos f = — tt 


Also, r = 0, so that 


. . /Lir esinf 

R R = rr= — = 0 

As a result it is seen that e sin f = 0. Therefore, it may be concluded 
that e > 0, f = 0. From the equation 

r p qa 

R = r (^cos f — + sin f — J 

one obtains 

P 

It is clear that the vector P is always in the direction of the vector R and 
is thus a rapidly varying parameter. Consequently, there is no guarantee that 
the method of variation of parameters and an approximate integration procedure 
will yield a satisfactory solution. 
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SUMMARY 


This report contains the development of a first order solution for the 
polar oblateness problem with the potential limited to the second spherical 
harmonic. The development begins with the equations of motion of the two- 
body problem. Expressions for a set of parameters are derived. The per- 
turbation equations of these parameters for an arbitrary disturbing force are 
generated, applied to the oblateness force and integrated analytically to obtain 
the first order solution. This solution is valid for all orbits except those 
which are nearly rectilinear. 
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I - INTRODUCTION 


The purpose of this report is to present the development of an improved, 
approximate, closed form solution to the equations of motion of a vehicle about 
a spheroidal earth. The nonsphericity of the earth is assumed to be character- 
ized by the second spherical harmonic. A feature common to some solutions 
which have been offered is a limitation on the applicability of the solutions in 

1 2 

the neighborhood of an inclination of 63°, due to a singularity at this inclination ’ . 
The original motive for the investigation, the results of which are presented 
here, was to examine the possibility of overcoming that restriction. Since the 
use of the argument of perigee is the immediate occasion for the presence of 
the critical angle of inclination, an obvious corrective measure is the choice 
of a set of parameters which does not incorporate that element. However , 
numerous other pitfalls must be avoided. Some of these are: a) indetermination 
of the initial value of the time of perigee passage for nearly circular orbits , 
b) degeneracy of the solution caused by the presence of the eccentricity in the 
denominator of the perturbation equations for nearly circular orbits , and c) 
the introduction of secular terms in elements which are clearly bounded as a 
result of the integration of the perturbation equations. The particular set of 
two-body parameters selected for the present development has been chosen 
so as to minimize the difficulties listed above. Neither time of perigee 
passage nor argument of perigee is included in the set of elements, none of the 
perturbation equations contain the eccentricity in the denominator and the in- 
tegration process is modified so that secular terms do not occur explicitly in 
the equations for bounded elements. However, it should be noted that the solu- 
tion is not applicable to nearly rectilinear motion. 
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The development is self-contained. First, expressions for the two body 
parameters are derived from the equations of motion, then perturbation equations 
for these parameters are obtained for an arbitrary disturbing force, and are then 
particularized to the oblateness problem. Next, these equations are integrated to 
obtain first order corrections. Finally, some remarks are included concerning 
the properties of the parameters, some general results of the second order theory, 
and some possible applications. 

In this report, the convention is adopted that capital Latin letters repre- 
sent vectors (or matrices), and small Latin letters with appropriate subscripts 
indicate the components of these vectors. 

n - DERIVATION OF A SET OF TWO-BODY PARAMETERS 

The equations of motion for a vehicle of negligible mass about a spherical 
earth are: 


R 


+ -^-R= 0 
r d 


( 1 ) 


where is the product of the gravitational constant and the mass of the earth, 

R is the position vector and r is the magnitude of R. A rectangular, inertial 
coordinate system is used with the equatorial plane taken as the x-y plane. 

The general solution to these second order, differential equations generates 
the vectors R and R as vector functions of six constants of integration and time. 
The six constants are determined by a complete set of initial conditions: vectors 


From the many constants that can be derived, an independent set must be 
selected. For application to the oblateness problem, the following set has been 
chosen: U, V, g, e cos 0, e sin 0 and t^. U and V are unit orthogonal vectors 
which specify the plane of the motion. The parameter, g, is the magnitude of 
the angular momentum vector, e is the eccentricity, and 0 is the angle measured 
from U to the perigee vector. The parameters g and e determine the shape and 
size of the osculating ellipse and 0 gives the orientation of this ellipse in the 
plane. Expressions for these parameters will now be derived and their in- 
dependence demonstrated. 
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Three of the constants are obtained immediately by crossing Eq. (1) on 
the left by R and integrating the result. 

R x R = 0 ( 2 ) 

• • 

R x R = G = R Q x R Q ( 3 ) 

The magnitude of this constant vector just defined is one parameter, g. 

The other two parameters are contained in the unit vector G/g which may 
be expressed as the cross product of two orthogonal unit vectors in the plane 
perpendicular to G. Thus 

— = U x V ( 4 ) 

g 

U is arbitrarily chosen to be in the direction of R^; this direction is not 
a constant of integration and therefore not a parameter. Thus R and R can be 
expressed as follows: 


R = (R • U) U + (R * V) V 

(5) 

R = (R • U) U + (R * V) V 

(6) 


Let<p denote the angle between R and U. Then R • U - r cos c p and R * V 
= r sin<p. 

• * 

To obtain an expression in <p for R • U and R • V, one proceeds as follows: 

£. u = -lUiLUZ) = 

r r 

= My . I r ( R * R) _ R 
g L r 3 r J 


Integration yields 
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( 7 ) 


R • U = -^V* — + c, 
g r 1 

Using the initial conditions to evaluate the constant, one obtains, recal lin g that 
initially U is in the direction of R Q , 


R "’ R “ M . V R 0 

+ * sm iD n = c, = - — - — — 


0 “0 . n 

f S ^0 = c l r, 


0 


0 


since = 0. Eq. (7) now becomes 

r. u = -iifH_LV g V R C 
gL r M r Q 

In the same way, starting from 


( 8 ) 


R * V = 


_ M u . I r» (R * R) R 
S L K 3 " r 


one gets 


R. y-ii (?. : m = 


g r 


(9) 


Using 


R.(G xU ) = G I _G > 
' ^ g gr ’ 


and the initial conditions, the constant c 2 may be evaluated 

_ ^ 2 N 

-SL_M= C =U(&-_ 1 ) 

r 0 g 2 SV 0 V 

Thus, finally, Eq. (9) becomes 


R- V = *[ + iV. 

g - r ^M r o v -I 


(10) 
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It is still necessary to express r in terms of (p in the new parameters. This 
will also allow one to express R as a function of these parameters alone. 
First, one multiplies Eq. (8) by sin<p and Eq. (10) by cos <p and subtracts to 

obtain 


R- (U sin cp - V cos cp) = ' g{f * ( Ucos <P +V sin<p) 

r R„ • R a ,-2 -v 

■ L sin * - ^F 0 - 0 cos f]} 


But 


R • (U sin <p - V cos <p) = R * Q ^ x ^ 


and 


— * (U cos <p + V sin <p) = 1 


Hence 


_ K = _ U{i _ I £ -5 sin cp - - l) cos <p I } 

r g L Lu r A v V ur n ✓ v J J 


g ' Lm r Q 


M r 0 


and 


r = 


_g_ 


r, rsL ^ gRo ,R o , i 

^l+costp^j^ sin<p J 

One now defines the parameters e sin 0 and e cos 0 by the relations 


(ID 


and 


1 = e cos 0 

4 r 0 

Ra' R n 

- £• '= e sin 0 

M r 0 
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where it is evident that e is the eccentricity and 0 is the angle measured from 
U to perigee. Eq. (11) thus becomes 


r = 


g 2 g 2 

M (l + eco S «p-e)) “Mi + eoosf) 


where f is the true anomaly. 


One can now rewrite Eqs. (8) and (10) as follows: 


<U') 


R • U = - j* (sin <p + e sin 0) (8') 

o 

R * V = ^ (cos <p + e cos 0) (10 ') 

A further expression is required to relate <p andt. In the process of deriv- 
ing this relation, a sixth constant of integration will arise. To do this one pro- 
ceeds by multiplying Eq. (8) by cos <p and Eq. (10) by sin<p and adding: 


R * (U cos <p + V sin <p) = - g (V cos cp - U sin <p) - j^ 

2 


V R 0 


COS <p 




The left hand side is R • R and the first term on the right is zero. Hence, 

r 

= °r n ~° COS< P + (^ -l)sin<pj=£esinf = i- 


0 


(12) 


From Eq. (11) 


t = 




r f g 2 g R 0 ’ R 0 

C os«p 


R„ • R„ 

o 


i /v 2 e - Ro * Ro 

p [1+ cos <p - 1) — Sin <p 


2 P O- 2 

I 2 g 


= it_ r 2fRJ_R-j £ . 
L r J jLi 


<P 
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or 




Hence 


r 


(13) 


This equation may be written as 




and using Eq. (11 7 ), one obtains 


& 

= * (1 + e cos f) 

u [l + ecosfJ 0 g v 


or 


2 


52 = U — 1 - e 2 + e cos f + e 2 cos 2 f + e 2 sin 2 f 

1 + e cos f 3 L 
g 


(14) 


Considering only the factor on the right, the following statements can be made. 


( 1 ) 1 L_ (1-e 2 ) is a constant and can be shown to be equal to , ^ 

g 3 V 1 “ e 


n 


where n is the mean motion 
2 


( 2 ) 


l4 eco8f(1+eco3f ) = |^^=i <^|-e si nf 
^ g 


(3) | e 2 sin 2 f = | j* e sin f r 


Combining the last two expressions above, one gets 

^2 [r ^ (e sin f) + e sin f t] = ^ (r e sin f) 


g 


g 
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Thus Eq. (14) now becomes 


<2 

1 + e cos f 



_y.iL 

2 dt 
S 


(r e sin f ) 


Integrating with respect to t, the left hand side becomes after some algebraic 
reduction. 


If 


dt 

+ e cos f 


J 1 - 


tan 


_1 s’ '/ 1-e sin< P 

v (1 + cos <p) (1 + e cos f) + e sin f sin 


in (p) 


and the final equation* is 



9 

— 2 V 1 “ e ( r e sin f + r Q e sin 6) (15)* 

s 

This is also the defining equation for t Q , the sixth constant of integration. 

In Eqs. (5), (6) and (15) the constants U, V, g, e sin 6, e cos 0 and t Q 
occur. To summarize, these constants are defined by the following equations: 


tt R f G R 1 

U=— cos<p-sm<pj_jx- j (16) 

V = ^sin<p+ cos<pj_^x (17) 

S = I G | (18) 

e cos 0 = ^ (R • V) - cos <p (19) 


* This equation holds only for e < 1. Only slight modifications are required 
required for e ^ 1. 
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( 20 ) 


e sin 0 = j* (sin <p) - R * U 


t = t - - tan 


2. -l^V 1 - e2 sin<P 


(il. 

Ml + 


n 1<U v(l + cos (p) (1 + e cos f) 


=-7—-) 

+ e sin f sm <p ' 


( 21 ) 


+ — ^ Jl - e^ (r e sin f + Tq e sin 8) 
ng 


It should be noted that U and V account for only two independent parameters 
since they are orthogonal unit vectors and the direction of U was chosen arbitrarily. 
It remains to be shown that the parameters just defined are independent of each 
other. This will be proved by showing the equivalence between the set above and 
the set Rq, R q whose elements are known to be independent of each other. 

Further discussion of these parameters appears in Section VI. 

From the derivation that has preceded, one easily obtains Rq and Rq in terms 
of the parameters on the one hand, namely, 


2 

_ 

r 0 - fx (1 + e cos 0) 

g 2 

R 0 ~ fj, (1 + e cos 0) U 

R. = C & - e sin 0^) U + (1 + e cos 0) V 
0 vg y 

and on the other hand, the following parameters in terms of Rq and Rq-. 


g=J(R 0 xR 0 )- (Rq X #t 0 ) 

V = f X ^ < G = R 0 X V 
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e 


2 

g 


cos 0 = 


M r o 


1 


e sin 


e = -s^ 

H 1 


R, 



ID - PERTURBATION EQUATIONS 

Before proceeding to the development of the perturbation equations, it 
should be observed that, of the quantities listed at the end of the preceding 
section, only six have been obtained as constants of integration. These are 
t^, e sin 0, e cos 0, g, and two contained in U and V which determine the 
plane of U and V. The third constant contained in U and V which specifies the 
direction of U in the plane is arbitrary. This last arbitrary constant does not 
vary under the action of the disturbing force. As a result, U is not subject to 
rotation about the angular momentum vector.* Since <p is measured from U, this 
restriction implies that cp does not include the time rate of change of U in the 
osculating plane. As a consequence, the time rate of change of <p must have the 
same functional form that it has in a purely Keplerian motion, i. e. , <p = g/r 2 . 
Keeping in mind the result just noted, the method for obtaining the perturbation 
equations for the set of parameters is as follows: 

• Each of the Eqs. (16-21) is differentiated with respect to time 

(considering the parameters too as functions of time) 

•• 

• Wherever R occurs, it is replaced by 

- + F (22) 

r 

where F is the disturbing force 

• The resulting equations are simplified by making use of the 
relations obtained in the preceding section. 


* Compare Ref. (4). 
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1. Equation for g 


Noting that 

g 2 = G • G 



G 


G = R x R 


one obtains 


G=RxR=Rx 


(r*\ 



x F 


and hence 


fe=(§ xR )' F 

2. Equation for U 

Differentiating Eq. (16) with respect to time 

U = (- sin <p R - cos <p ^ x R )<p - R ^ 2 ~ u 

+ coscp^- sin<p(|x R )- sin<p[(i)x R 


Then, using 


and 


r = M j^_ ( S i n <p + e sin 0) U + (cos <p + e cos 0) V 


— x — = - -W- 1 (sin (D + e sin 0) V + (cos <n + e cos 
g r gr L ^ 


it follows that 


R 

cos <p — - srn <p 



sin f U + (1 + e cos 



But 

^•esinf = ^-£ (Eq. (12)) 

r 

and 

£ (1 + e cos f) = (Eq. (11 7 )) 

s r 

or 

j£( l + ecosf)=<p 

The coefficient of <p in the U equation above is then simply - V. Replacing these 
terms in the U equation 


U = - V<p - U + ^ 


U+ V<p- sin<p[^Qpx^] 


but 


and hence 


or 




U = - sin <p-(- • f)- 
v g v g >' g 


3* Equation for V 


hi an entirely analogous fashion one obtains V, 


(25) 
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( 26 ) 


V = cos„f(§-F)f 

4. Equation for (e c6s 9) 

(e cos 0) = **■ R • V + sin <p <b + ^ R * V + ^ R • v 

fX fX fX 

From Eq. (26) it follows that 
R • V = 0 

and, rearranging terms, 

(e cos 6) = VT si n <P + ip sin ^ + u ** * ^ 

r MM 

The first term on the right is identically zero, so that, finally 

(e cos 0) = S F • V+&R' V (27) 

v M M 

5. Equation for (e sin 0) 

In analogous fashion 

(e sin 0) = - I ^ R * U + £ F • U 1 (28) 

L|i M j 

6. Equation for t Q 

In order to simplify the derivation of t Q , Kepler time derivatives will be 
assumed to have been canceled. In addition, in order to simplify the writing, 

e cos f = p e cos 0 = P Q (29) 

e sin f = q - e sin 0 = q Q 

* Starting from Eq. (15) 
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_1 r V 1 "® 2 — f 

n (t-t 0 ) = 2 tan | 

^ cos ^ (1+p) + q sin % J 


-7l-e 2 ( 


q q o S 


v i+p i+p Q y 


one obtains, on differentiating with respect to time. 


n (t-t Q ) -n t Q =■ 


(1-e 2 ) sin 2 % 


A(. 

dtv 


-f7- 


cos ^(1+p) + q sin^ 


nr) 

in si 


1 + T 


j^cos ;|(l+p) +q sin^ 

q 


<2 


(30) 


vi+p i+p n y dt W 1 e y V 1 dtvi+p i+p y 


a. The first term on the right of Eq. (30) becomes 


sin | (cos| (1+p) + a sin^) Q 

, 1-?) - ^'l-e 2 sin & (jp cos^ + Q sin j 

i cos ^ (1+p) + q sin | | 

2 / - 2. . 2 (rt 

+ (l-e ) sm g 


9 0 0 

The denominator, by making use of the identity p +q = e , is 


( 1 + p ) ( 1 + p Q ) 

b. The last term in Eq. (30) is 

_ /i_ e 2 ( Q ( 1 + P) ~ qP _ (1 + P o* ~ Po % 

^ (1 + p ) 2 ( 1 + P G ) 2 

c. The right hand side of Eq. (30) thus becomes 


r — 5 j2 sin f (poos f+qsinf) t, (1 + p) -q i. d + P„> - !>„% 

' v 1 — ^2 1 

±f / 1 _ e 2' ) 

+ (f'+^T(r+p o) { 2 sin 2 ! .( 1+ P )cos f + q sin f]-q( 1 +P 0 ) + q o( 1+ P) 


3^1 



_d [7~T = _ (p p + q^) 

dt V 1 e > — ^ 


1 -e 


The coefficient of this derivative becomes, after some algebraic manipulations, 
(1 - e 2 ) sin<p and one gets for the whole equation, 

_ (1 2 S (pp+qb) sinp + p since + (1 - cos (g) q 

n (t-t 0 ) - nt 0 "v 1 e 1 (1 + p) ( 1 + P Q ) 


. • b (1 + p ) - p q 

+ (1 + p) k ~ qp _ v F o ; p o O I 

(1 + p) 2 (1 + P n ) 2 


e. Using the fact that 


P o = p cos <p + q sin c p 
q Q = - p sin ip + q cos <p 
p Q = p cos <p + q sin c p 
b = - p sin io + q cos c p 


b Q (1 + p Q ) - P 0 b 0 (p sin<p - q cosep) (1 + P Q ) + (pcoS(p+ q sin<p)q o 


(1 + P/ 


(i + P 0 r 


and rearranging, one gets 


h (t-t n ) - nt fl = 


: - V 1 -* 


•»' , 1 + p) 2 (l + p o 2 ) 


|pi (p sin<p + sintp) (1 + p) (1 + P Q ) 


- q(l + p o ) 2 + (sin <p (l+p Q ) + q Q coS(p)(l + p) 2 


+ q £(qsin<p + 1- cos<p) (1 + p) (1 + P 0 ) + ( 1+ P) ( 1 + P Q ) 2 

+ (Q 0 sin <p - cos <p (1 + P Q ) ) (1 + P) 2 ] } ( 31 ) 
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f. Noting that 


P =Vi 

q-_g R * R 


pr 

U r 


P-**(l + P> 

4 = f-q + ^~ +E) R * F 
g g 

(32) 


it is now possible to express Eq. (31) in a more convenient form. First, the 
right-hand (R. H. ) side of Eq. (31) is rearranged as follows: 


- ,/1-e 


R.H. = 


it 


(1 - +p) 2 (1 2 } tg (1+p) L 2 Sin <°(l + P) < 1 + p o> - Q < 1+ Po> 

r 2 ' N 

+ 2 '..sin <p (1 +P Q ) + q Q cos <p) (1 + p) + q (1 + p Q ) (q sin cp + 1 - cos 0 ) 
+ ( v qq o sin<p -q cos<p (1 + P Q )J (1 + P) J 


+ (1 + P) 


2R-F 


(q sin a + 1 - cos <p) (1 + p ) + (1 + p ) 2 


+ (q,, sin<p - cos <p (1 + p Q ) ) (1 + p) j } 


1) The coefficient of g is rewritten as follows: 


2 sin<p (1 + p) 2 (l+p Q )+2 (sin<p (l + p Q ) + cos <p) (1 + p) 2 \qq Q sin<p 

'\ 

- q cos <p (1 + p Q )j (1 + p) 
o 

- (! + P Q ) (Q + qp Q - q sin <p - q + q cos <p) 

Replacing p Q by p cos (p + q sin <p, one can factor out (1 + p) from the whole 
expression and rearranging again. 
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(1 + p) {2 (1 + P Q ) sin to (2 + p) + ( 1+ P 0 ) (2 P sin<p - 2qcoscp) + 2(l + p)q Q cos <p 


+ q q Q sm <0 


But 


and 


- p sin <p + q cos <p — q Q 


q sin <p + p cos <p - p 
thus by Eq. (34) 


(33) 


(34) 


(l + p){2 (1 + P 0 ) sin <p (2 + p) -2(l+p o ) q Q + 2 (1 + p) q Q cos <p + qq Q sin<p} 


= (1 + p) I 2 \sincp (l+p o ) (2 + p) - q Q (l-cos<p)j - q q Q sincp 


2) The coefficient of R • F is rewritten as follows: 

p 1 

2 ( 1 +p ) (qsin<p+ l-coscp) + (l+p Q ) [ 1 + P 0 - qsintp - 1+ cos<p - cos <p-pcos<pJ 
+ (1 + P) % sin <p 

By using Eq. (34) this expression reduces to 

2(1 + p ) (q sin <p + 1 - cos tp) + q Q sin <p (1 + p) 

Putting the last two results together, one gets 
1 

I T 

- ,/ 1 -e^ r - n 

R.H. =— 5 — j& 2 {sin tp (1 + p ) (2 + p) - q (l-cos<p)j -qq Q sin <p] 

(1 + P 0 ) g 

+ R • F j^2 (l + p Q ) (q sincp + 1 - cos (p) + (1+p) q Q sin(pj T 
g. h (t-t Q ) - n (t Q ) 
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Noting that 


B - Hl 

n V 3 
a 


. _ 3 „ k 

n -2 n a 


• • 


1 _ 2 R • R 
a r fj, 


2 a 2 * 

a = R • F 


thus 


and 


n = - 3 — R • F 


3a 


n(t-t 0 )-n(t 0 ) = -n|_^R* F (t-t Q ) + t Q | 


M 

If both sides of Eq. (31) are then divided by n, the coefficient of R. H. becomes 


1 - e‘ 


gn(!L + P 0 ) 


but 


I 

V 


1 - e* 


g -/jla 
and the coefficient becomes 


/i \ 2 V 2 , .2 M 

( 1+ P 0 ) H a < 1+ P 0 ) 


Thus, Eq. (31) becomes finally 


V7r R ' F (t_t 0 ) = M (l+p ) 2 {&[ 2 £ sin <P < 1 + P 0 } ( 2+ P) -% (1-cos <p)} - qq^j Sin<p] 


+ (R* F) j^2 (1+P Q ) (q sin <p + 1 - cos <p) + (1+p) q Q sin<p J | (35) 
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IV - APPLICATION OF THE PERTURBATION EQUATIONS 
TO THE POLAR OBLATENESS PROBLEM 

For this problem, F = Vi* where 



(36) 


and where k is the coefficient of the second spherical harmonic due to the 
oblateness of the earth. Then 


F = - 


3juk, 


“ iC 1 ■ 5 \) r+ 2 z k 


By Euler's theorem 


R • F = 



(37) 


(38) 


Since all the expressions on the right side of the perturbation equations are ex- 
pressed more simply in terms of <p than in terms of t, derivatives with respect 
to t will be replaced by derivatives with respect to <p. For this purpose relation 
(14) is used, i. e. , 



from which one obtains 

_d_= dt d__ C (39) 

dp d<p dt V g/ dt 

The right-hand sides are expanded in terms of trigonometric polynomials in 
multiples of <p with functions of the parameters as coefficients. 

For purposes of integration the perturbation equations are all written as 
the sum of two parts, the first of which, indicated by a subscript H will be integrat- 
ed "exactly" while the second part, indicated by a subscript S, contains short period 
terms only. 
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The perturbation equations are as follow: 
1) (e cos 0) ' = (e cos 0) H ' + (e cos 0) g ' 


where 


(e cos 


2 

m '=i ahrii 2 2 X / 

a o ( u q + V, ) - 1 


H g 4 L 2 ' 3 *3 


(1 + 2 e cos f) sin <p 


(40a) - 


=»A 


(e cos 0) g ' = - 4 


g 


{ 12 2 1 
“ 4 ^ u 3 “ v 3 ) ( 7 sin 3 co + sin <p) + ^ UgVg (7 cos 3<p + cos <o) 


LI 2 2 -j 

~2^ U 3 ~ v 3 ) (3 sin 4<p + 5 sin 2<p) + UgVg (3 cos 4<o+ 5 cos2<p) | 


L ^3 2 2 -i 

2 ^ U 3 “ v 3 ) ( cos 4 <P " cos 2 <p) + 3u 3 v 3 ( sin 4 co - sin 2 <o)J 


+ (e sin 0) (e cos 0) |_ - 2"(cos3 <p - cos <p) + (-g- cos 5 <p + ^cos 3 <o-^cos(p)u. 


(40b) 


2 , 5 


+ V § cos 5<p + geos 3(o + Jcosco)+u 3 v 3 (| sin5 <p+| sin 3cp-|sin<p) 


+ (e cos 0) 2 (sin 3 <o + sin cp) - (5 sin 5 co + 23 sin 3 cp + 18 sin <p) Ug 2 

1 o U Q V Q 

+ 16 (5 sin 5 <0+11 sin 3 p+ 6 sin <p) Vg +— g — (5cos5<p+17 cos 3 <p + 10 cos <p) | 

2 

2 r i 

+ (e sin 0) ^ (sin 3 <p - 3 sin co) + -jg- (5 sin 5 <p - 5 sin 3 <p - 10 sin <p) 

2 

V 3 . 

+ jg ( - 5 sin 5 <o+ 17 sin 3 <p- 26 sin <p) + — g — (- 5 cos 5<p+ 11 cos 3 <p 

- 6 cos <p) | } 

2) (e sin 0)' - (e sin 0)^' + (e sin 0) g ' 

where 


3^7 



(e sin 0) H ' = — ^ [| (Ug 2 + Vg 2 ) - 1 (1 + 2 e cos f) cos cp (41a) 

g 4 

_ 2 , 

jU f 1 o O 1 . 

(esin0) ' = - W(u -v 3 ) (7 cos 3 cp - cos <p) - g UgVg (7 sin 3 cp - sin cp) 


3 U iCg | 

"3 , 2 

4 1 

J2 U 3 

g 


_ 3 A 2 f 

1/ 2 

4 ^ 

’ 4 U 3 

g 


+ e cos 0 1 

~2 (U 3 

+ esin0^ 

1 . 2 
"2 U 3 


+ e cos 0 ^ - 1 (Ug 2 - Vg 2 ) (cos 4 cp + cos 2 cp) - 3UgVg (sin 4<p + sin 2 <p) 

+ esin0^-^ (Ug 2 - Vg 2 )(3 sin4<p- 5sin2 <p) + UgVg(3 cos 4(p -5 cos 2 cp) 

2 

ri u q 

+ (e sin 0) (e cos 0) |^j(sin 3 cp + sin cp) + — g- (-5 sin 5cp - 3 sin 3 cp + 2 sin cp) 

v 2 U 3 V 3 J 

+ — jjp (5 sin 5cp-9sin 3cp-14 sin cp) + ~ g — (5 cos 5 cp - 3 cos 3cp - 10 cos cp)J 

2 

nri U <5 

+ (e cos 6) I (cos 3 cp + 3 cos cp) “"Jg" (5 cos 5 cp + 17 cos 3 cp - 26 cos cp) 

v 2 u v 1 

+ -jg- (5 cos 5 cp + 5 cos 3 cp - 10 cos cp) “~~g — (5 sin 5 cp+ H sin 3 cp+ 6 sin cp) J 

2 

+ (e sin 0) 2 jj- ^(cos 3 cp - cos cp)+~ ^g (5 cos 5 cp - 11 cos 3 cp + 6 cos cp) 

v , 2 u V 

+ -^- (- 5 cos 5 <P+ 23 cos 3 cp - 18 cos <p) + — g — (5 sin 5 cp - 17 sin 3 cp + 10 sin cp 
Next, the perturbation equations for the components of U and V are given. 

3) ( Ul )' = + OVg' 


0 fJL kg gg C gg Sq 

( u i) H ' = ~r~ sin <py 1<- V 2 +u iT ) cos< p + (u 2 + v i Y } sin ^ J 

g 


(42a) 


3^8 


4) 


6 A 2 go rs gox ^ gqN . 

(U 1 } S 4 ( ecosf )Y sm <Pl ( C V 2 + u io7‘y >cos < P + V U 2 + V li _ y sin n 


< u 2 } ' = (Vh' + (u 2 ) s' 


6 A, 


^ U 2^H ' 4 

g 


g 3 

T 


f C g 3"\ ( 

' ®3 > N 

lVl +u 2 ^ cos< P + V 

.‘ U l + V 2 g ) 


(43a) 


6^i kg g„ r Z' So \ Z' gq"\ "I 

^ u 2^s = 4 {e cos f) y shi p »> W + u 2 gV cos ^ + v u i + v 2 i"y sin n 

g 


5) (u 3 ) ( u 3 >h / + ^ U 3^S ' 

6 M 2 k 2 ^g 3 ^2 

( U 3>H 4 — v7y sin ( u 3 cos <P + Vg sin <p) 


(44a) 


k 2/%\ 2 

(u 3 ) s ' = — 4 — \Tgy ( e cos f ) sin <° ( U 3 cos <P + v 3 sin <p) 

g 


(44b) 


6) 


< v l> -< v 1 >h' + < v 1 >s' 


6u 2 k 2 gg r g 3 x r g,N <1 

( v i)u a — c °s<o— 1^- v 2 +u iY J COS( p + \} 1 2 + v iY y sin< Pj ( 45a > 


l'H 4 

g 

6,/k 


bfi g . . ^ 

7 = " ~T~ < e cos f ) Y COS ^ 1C" v 2 + U 1 ~) COS <f> + C U 2 + V 1 ^9 sin <p} 


(42b) 


(43b) 


(45b) 
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( v 2^ *Vh + ^Vs 


6 A? 


(V 2 ) H , = -^ _C0S<P T 1 ‘-1 +U 2 

g 

Gn\ g 3 (S 
{v 2 ) s ' = ~ — 4- e cosf— cos cp^v 


M( v l +u 2?) COS<P+ C" U l +V 2^) Sin<P } (46a) 


1 +u 2 ^- cos + (^- u 1 + v 2 ~g \) sin <p} ( 46b ) 


< v 3 > # = ^ V 3 ) H ' + ^ V 3 ) S ' 

6ju 2 k ^g ^2 . 

(v 3 } H 7 = " ~T~ W COS (U 3 COS ^ + V 3 Sm <p) 
g 


6u\ 


v ^ g . 2 

(v 3 )g ' = ^ (e COS f)(-f j cos <p < u 3 cos <p + v 3 sin <P> 


(47 a) 


(47b) 


9) gV = g H ' + 8s 


g ' = - 6 fi 2 k (v 3 cos cp - u 3 sin <p) (Ug cos <p + Vg sin cp) 


(48a) 


g 7 = - 6 p 2 kg e cos f (Vg cos <p - Ug sin cp) (u g cos cp + v g sin cp) (48b) 


10) t f 


t 0 +— i(t-t 0 ) + — $= — 2^{LI (U 3 2 + V 3 2 

0 u 0 V (l+ecos0) 2 g 3LL2 3 3 


)-l](l-e 2 ) 


- / o 17 2 3 3^ 

+ coscp UgVg^^esinS + ^-esinGecosQ+^esinS (ecos0) +-£ (e sin 0) j 

- < U 3 2 - v 3 2 )(l + f e COS 0 + 8 (e COS 0)2 + l (e C0S 6)3 + 1 (e Sin 9)2 ) 


(49) 

(coni' d on next page) 
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+(§ ( V v 3> - v Q ‘ 2 - ' e cos e + \ (e c °s 0) 2 - |(e cos 0) 3 

- ^ (e sin 0) 2 - e cos 0 (e sin 0) 2 ^) ~\ 

+ cos 2 <pL u 3 v 3 (| e sin 0+| e cos 0e sin 0)+ (Ug 2 -Vg 2 ) cos 0 -|(e cos 0) 

+ | ( e sin 0J 2 ) - (^| (Ug 2 + Vg 2 )- l)(e cos 0 + (e cos 0) 2 
+ ^(6 8^ 0) 2 ^) 

r 39 i 9 11 on 

+ cos 3<p [^Ug Vg v - 2 e sin 0 + -g- e sin 0 e cos 0 + g e sin 0 (e cos 0) — ~ (e sin 0)J 

+ (Ug 2 Vg 2 )(^2 + If e cos ® ~ £ ( e cos 9) 2 + (e sin 0) 2 + -| (e cos 0) 3 

+ e cos 0 (e sin 0) 2 ^)-^|- (Ug 2 +Vg 2 ) - l) 

- !assj) 2 e cos e (e sin 9) 2)J 


+ cos 4<pJ^- UgVg e sin 0 +^e sin 0 e cos 0) + 3 '( u 3 2-v 3 2 ) C 2 e cos 0 + 2 (e cos 0) z 

- (e sin 0) 2 ^) j 


+ cos ! 


5(p[^- 3 3 (j- 21 e cos 0 e sin 0 - 26 e sin 0 (e cos 0) 2 + 5(e sin 0) 3 ^} 


+ ia 2_ v 2^ec^ +i eossef_ | e cos e (e sin e) 2 _| (e sin0) 2) j 

+ sin<pj^-UgVg(l + ^ ecos 0+|- (e cos 0) 2 +^ (e sin 0) 2 ecos 0(e sin©) 2 ) 

2 2^ 1 1 21 o\ 

+ (Ug -Vg )l v esin0-— e sin0e cos 0-g- esin0 (ecos0) -^-(e sinO)* 3 ^) 

(49) 

(cont'd on next page) 



+ (| (u 2 + v 3 2 )-i)( e sin 0 e cos 0 - i e sin 0 (e cos 0) 2 -|(e sin 0) 3 ) j 


+ sin 2<p[-3 u 3 v 3 (^1 " ( e sin 0) 2 - (e cos 0) 2 - \e sin 0 e cos o) 


- 5( u g 2 " v 3 2 ) (® sin 9 + \ e sin 0 e cos ©J - (2 e sin 0 + 3 e sin 0 e cos 0) 

C| (u 3 2+v 3 2)-1 ). 

+ sin 3 (o [ U 3 V 3 C 7 + ^ e cos 0 + ^ (e sin 0) 2 - (e cos 0) 2 + 1 (e cos 0) 3 

+ \ e cos 0 (e si? 0) 2 )+ (Ug 2 -Vg 2 )(e sin 0 + (e sin 0) 3 - jje sin 0 (e cos 0) 2 
- ^ e sin 0 e cos 9 ) + ^ U 3 2+V 3 2 ^“0C“ e sin ® e cos 0 

+ -| (e sin 0) 3 e sin 0 (e cos 0) 2 J j 


r s o 2 3 

+ sin4(pj^u 3 v 3 < v 6ecos0+6(ecos0) -3(esin0) -^esin0ecos0 

+ (Ug 2 -v 3 2 ) (j3 e sin 0 + | e sin 0 e cos fi)j 


+ sin 5<p [ 5 u 3 v 3 Q- (e cos 0) 2 + ^ (e cos 0) 3 - 1 (e sin 0) 2 - \ e cos 0 (e sin 0 )' 


5 (u 3 2_v 3 2) Q e sin 0e cos 0+ e sin 0(e cos 0) 2 - j|-(esin0) 3 ) 
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V - FIRST ORDER SOLUTION OF THE 
POLAR OBLATENESS PERTURBATION EQUATIONS 

In solving the perturbation equations derived in the preceding section, 1 3 
general procedure is to solve exactly for as large a piece of the equation as 
possible, assuming that the parameters appearing in that piece are constant 
(except that parameter for which one solves in that particular equation). * The 
remaining terms are then integrated holding all the parameters cons tant and 
the results are added to the solutions obtained in the first step. This procedure 
is justified because it is equivalent to a second application of the variation-of- 
parameters method in which only first order terms are retained. Thus in the 
perturbation equation for a parameter x, that part, labeled x H ' in the preced- 
ing section, is the piece of the equation that can be solved exactly under the 
restrictions mentioned above. The remaining part of the equation, which is 

integrated keeping all the parameters constant, was labeled x ' in the preced- 

b 

ing section. 

The equations to be solved to obtain e cos 0, e sin 0, u , u„, u„, v„ v 

1 A U 1 6 

and Vg are Eqs. (40 through 47). Ctf these equations, those lettered "a" may be 
divided into the sets of coupled equations j^(40a) and (41a)] , j (44a) and (47a) j 
sftd (42a), (43a), (45a) and (46a) . These sets are solved by standard methods 
with the following restriction: In each system of equations, those parameters 
on the R. H. S. (right-hand side) which do not appear on the L. H. S. are kept con- 
stant. Thus, for example, in the system of Eqs. (40a) and (41a) the parameters 
g, u 3 and v 3 appearing on the R.H.S. are held constant on solving this system. 

The first order solutions are then the following: 


* Although the method employed here and that prescribed by the method of 
averages have different theoretical justifications, the application of the two 
methods requires the solution of equations which appear to be quite similar. ® 
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where e 


A 




where 


(52) 


a c~" <1*2! + 25)cos lX 1 l(p.+ (|X 1 ?+2 J 8) cos l^l <p-(|X 4 |-28)cos | Xg |<p 
+ (IX 3 | - 2/8) cos 1X 4 | <p 


a s ‘ (!X 2l + 25)sin l X il<P + ((X 1 1 + 2S) sin ! X 2 1 <p - ( | X 4 1 - 25) sin j Xg ! <p 
+ ( 1 X 3 U- 2/3) sin |X 4 |<p 

b s = (!X 2 !-2a)sln |X 1 |<p-(|X 1 |-2^sin|X 2 |<p-(!X 4 |-2 a )sin | Xg | <p 



+ 


(IXgl -2a) sin |X 4 | cp 


b c = (|X 2 !-2a)cos|X 1 !<p-(|X 1 |-2 0! )cos |X 2 | <p- ( IX 4 ! - 2«) cos ! Xg | <p 

+ (|Xg| - 2a) cos |X 4 1 cp (53) 

(cont’d on next page) 
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c « - ( 1 X 21 + 218)008 IXJ^+OXjJ +28) cos |X 2 |cp + (|\ 4 | -28) cos |X 3 j<p 
- (|X 3 | - 26) cos jX 4 j <p 

Cg = - (|X 2 1 + 2^3) sin ! X 1 j(p + (|X 1 j+28) sin ^ | cp + (|X 4 l - 2^) sin jXgj <p 


- ( 1 X 3 1 - 26) sin |X 4 1 <p 


(53) cont'd 


d =-(|X 9 ^)coslX 1 !<p + (lX 1 l-2 a )coslX 2 l<p-(lX 4 l-2 ( y)cos |Xg| cp 

C “ 


+ (jX 3 ! - 2o) cos IX 4 1 <p 

d s= - ((X 2 1 - 2a) sin |X 1 |<p+ (|X 4 j -2o)sin iX^tp-^Xj -2a) sin | Xg | <p 
+ (|Xg! - 2a) Sin |X 4 | <p 


and 


a- € 


£3 

®o 


6 = e 



I = a + 1 +'Va +2/3 + 1 


IV 


= a + 1 ~ + 2 8 + 1 


= (a - 1) + v a +2/3+1 


U 4 | - (a- 1) -V a ‘ + 20+1 




The absolute value signs used here 
indicate that a factor i is omitted 
from the X's which are the character- 
istic roots for the system of Eqs. 
(42a, 43a, 45a, 46a). 


The equations to be solved to obtain g are Eqs. (48a) and (48b). The R. H. S. 
of Eq. (48a) is a perfect differential if one takes into account the equations for 
u_ / and v ' (Eqs. 44a, 44b, 47a, 47b). In Eq. (48b) the parameters appearing 

O ' u 

in the R. H. S. are given their initial values and one obtains finally the following 
expression for g 
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(54) 



COS <p + Vg 



The equation to be solved to obtain t Q is Eq. (49). The L. H. S. of this 
equation is a perfect differential, provided one assumes that a is a constant, 
a Q , and that the t Q occurring in the second term on the L. H. S. is also a con- 
stant. In a first order solution this is justified. Thus, the L.H.S. of the 
equation integrates to 

3a 0 

t 0 w ♦ — * <t - 1 00 > 


In the R. H. S. of the equation, the parameters are assumed to have their initial 
values and the integration is performed with respect to <p. Thus 


*0^ *00 " ju ^ W r 


3k„ 


_l+(e cos 0) o ] 2 g o 


[ I < U 30 2 + v 30 2 > - X ] <P 


(55) 


+ integral of other terms. 


The limits of integration are zero and<p. 

SECTION VI - DISCUSSION OF THE RESULTS AND APPLICATIONS 


Now that the perturbation equations and their first order solutions are avail- 
able for examination, some distinctive features of the parameters become apparent. 
It has already been noted that the parameters U and V are perpendicular unit vectors 
which are to be regarded as rigidly attached to the angular momentum vector G 
throughout the motion. These parameters thus differ in an essential way from any 
of the conventional sets of parameters such as the Delaunay elements or initial 
conditions, because to relate the initial values of the parameters with their values 
at time t requires knowledge not only of the position and velocity initially and at 
time t, but also a knowledge of the trajectory between these times. For any con- 
ventional set of elements, on the other hand, knowledge of the initial and terminal 
conditions is sufficient to determine the initial and terminal values of the elements. 

It may thus appear at first sight that the elements used in this report involve 
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complications that are not present in the use of conventional elements. It must, 
however, also be recalled that the present elements are so defined that the 
independent variable <p has no perturbation derivative, while with conventional 
elements the independent variable, usually either true or mean anomaly, does 
have a perturbation derivative, which introduces complications in the derivation, 
and integration of the perturbation equations. Further, even though the present 
elements are functions of the trajectory (and hence of the particular perturbing 
function used), once the perturbation equations have been integrated the fact that 
the solution of these equations must be used to determine the elements poses no 
fundamental problem. 

In this report only the first order solution of the perturbation equations has 
been presented. To obtain the second order solution Eqs. (40) to (49) are integrated 
again replacing those parameters held constant in the first order integration by 
their first order solutions. The integration of these equations involves a great 
deal of routine trigonometric manipulation and will be the subject of a later report. 
It is, however, possible to state a general conclusion on the results of the integra- 
tion. This conclusion is that the second order terms will be small compared to 
the first order terms for a time of the order of 100 periods. This means that 
for any problem (for which the first order solution has sufficient precision) the 
first order solution is usable and valid for about 100 periods. The reason for 
this is that in the second order solution, terms of the form 

§jg..Ua and 1 - cos_u£. (56) 

e e 

2 

occur with coefficients of the form A kg where A is of order unity. Noting that, 
for any e 


1 - cos e o 


i • 2 

1 sin e <p 


sin e c p 



2 € 




it is evident that no such term can creep into first order so long as 
Ak 2 2 ffl <k 2 
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or 


JO. < -1 

2n 2nAk 2 


-100 


It should be remarked that if e vanishes the first of the terms (56) is secular and 
the second is a constant. It turns out that for at least two particular sets of initial 
conditions there will be secular terms, namely, for initial conditions such that 
the angle of inclination is 63.4° and 67.8°. Thus, critical angles of inclination 
occur in this formulation, but not in the same way as in conventional theories, 
for which only one critical angle has been found. The significant difference is 
that in conventional theories the critical angle appears as a singularity in the 
second order solution, whereas in the present theory the second order solution 
has no singularity, and while it is unbounded in time, it will not affect the first 
order solution for about 100 periods. 


One might inquire what sort of precision can be expected from the first 
order theory. In order to discuss this question, it must first be remarked 
that parameters associated with the Kepler problem may be separated into two 
categories. Parameters such as the semimajor axis, the eccentricity, and the 
angle of inclination, as well as functions of such parameters have only short 
period terms in their first order corrections. Other parameters such as 
argument of perigee conventionally contain not only short period terms but also 
secular terms. No first order secular terms appear in this formulation because 
of the way in which the differential equations (40) to (47) are separated. The closed 
form contribution to the first order solutions obtained in Section IV from Eqs. 
(40a), (41a), . . . (47a) include the analogues to the secular terms as well as such 
short period terms as could be included in the closed form integration. Suppose 
now that one numerically compares trajectory predictions based on the Kepler 
problem, based on the first order solution derived in this report and based on a 
high precision numerical integration. If the comparison is made for <p = 2n, all 
short period terms will disappear. Those parameters involving only short 
period terms should be the same for both the Kepler and the first order predictions 
and should agree to about six significant digits (since k 2 = 10 _ ) with the precision 
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calculation. The remaining parameters should be given to about three more 
significant digits by the first order theory than by the Kepler estimate. If the 
comparison were made on functions of the elements, rather than on the elements 
themselves, one would still expect the first order theory to yield about three 
more significant digits than the Kepler estimate except for functions independent 
of parameters containing secular terms. A comparison, atcp = 2 tt, on position 
and velocity would thus be expected to yield, in general, three more digits from 
the first order estimate than for the Kepler estimate. Preliminary numerical 
comparisons indicate that this is indeed the case. 

The application of the theory developed in this report for prediction is 
fairly direct. To obtain position, velocity and time corresponding to a speci- 
fied p, one simply evaluates the elements from Eqs. (50) - (55), and then sub- 
stitutes in Eqs. (5), (6) and (15). To obtain position and velocity at a specified 
time it is necessary to replace all elements in Eq. (15) except t and by their 
expressions in terms of <p, to obtain a transcendental relation between <p and t. 

The angle <p would then be obtained by numerical solution of this equation. Once 
<p is known, position and velocity are obtained as above. 

The boundary value problem is somewhat more difficult. In this case one 
would require knowledge of seven conditions, some given at the initial point and 
the rest at the terminal point. Now Eqs. (50) to (55) give the parameters as 
functions of <p and Eq. (15) relates <p and t. Eqs. (5) and (6) give position and 
velocity as functions of the parameters. The boundary conditions would thus 
give seven equations for the determination of six independent parameters and 
t. The solution of these equations would have to be carried out numerically 
because of their transcendental character. 

In conclusion, one might comment on some special solutions of the perturba- 
tion equations. If the initial conditions are such that the initial orbit is either 
equatorial or polar the U and V vectors are constants of the motion. The perturba- 

• • go 

tion equations (42) - (47) for U and V contain as a factor on the right hand side. 
For polar orbits g„ vanishes and hence U and V are constant vectors. For an 

O 

equatorial orbit u g = Vg = 9 and hence, again, the right hand side of Eqs. (42) - 
(47) for U and V vanishes. This last result illustrates one advantage of these 
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parameters. The conventional elements include longitude of the node and 
argument of perigee which are not defined for equatorial orbits and hence 
modifications are required for the treatment of this case. 

In Eqs. (40) and (41) the expression (1-3 cos^ i) can be shown to be a 
factor of the right hand side. This factor vanishes for an angle of inclination 
of 54. 74° and hence for an orbit initially at this inclination the eccentricity 
and the parameter 0 (angle between U and perigee) are constants of the motion. 
The critical angles 63. 4° and 67.8° which appear in the second order theory 
have no obvious significance for the parameters used in this report. It is 
curious, however that these three angles have the property that 


respectively. 
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AN EMPIRICAL STUDY OF CONFIDENCE LIMITS 
FOR DESIRED CUTOFF CONDITIONS 

BY 

R. E. WHEELER 
SUMMARY 

A statistical procedure, taking into consideration variations due 
to changes in launch times as well as errors introduced by the path 
adaptive guidance polynomials, was designed to obtain confidence li- 
mits for desired cutoff conditions such as radius, velocity, etc. The 
following upper bounds for 2c r limits were obtained for a given example: 
2100 meters for the radius, 1.615 meters/second for the velocity, 

. 2692 degrees for the flight path angle, and . 026 degrees for the orbi- 
tal inclination. 
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INTRODUCTION 


It is our purpose in this study to design and perform an experiment 
that can be used to obtain confidence limits for desired cutoff conditions 
such as radius, flight path angle, velocity, and orbital inclination. We 
are interested in two results - the design of a statistical procedure that 
could be employed for similar problems and the actual numerical re- 
sults from this particular experiment. The confidence limits obtained 
will be such that they will take into consideration variations in launch 
times across a selected launch window as well as errors introduced 
by the path adaptive guidance polynomials. 

In the example that we considered, a volume of trajectories was 
computed by the Boeing Company using the theory of calculus of vari- 
ations. Each of the trajectories obtained, if flown, would place a 
vehicle at the desired end conditions in an optimum manner. Like- 
wise, multidimensional polynomials were computed by the Boeing 
Company, to fit or approximate the volume of optimum trajectories. 

In our experiment the data consists of results obtained by running on 
a computer actual trajectory simulations with guidance commands 
being provided by the polynomials. These simulations, as run by the 
Boeing Company, used the steering and cutoff polynomials to guide the 
flight. 
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THE DESIGN OF THE EXPERIMENT 


Now in the design of our experiment, instead of selecting a random 
sample from the volume of all optimum trajectories associated with a 
given cutoff condition, we elected to take a sample from a sub- set of 
the universe of all optimum trajectories. 

A description of the sub- set of the universe from which we select- 
ed our sample involves the definition of a nominal trajectory. In the 
process of generating a volume of optimum trajectories a certain op- 
timum trajectory that satisfies selected performance criteria is clas- 
sified as "the nominal trajectory". The universe of trajectories that 
we considered in our experiment was generated by considering varia- 
tions or perturbations of parameters from their values given for the 
nominal trajectory. 

Thus each vehicle parameter and flight parameter was assumed 
to have a nominal value. In a like manner the tolerance or standard 
deviation measuring variation from this nominal value was taken to be 
known for each parameter. It was further supposed that deviation from 
a nominal trajectory could be caused by any one or combination of a 
number of independent error sources or parameters, each of which 
was normally distributed about the nominal value as a mean. 

Under these assumptions the universe that we considered consisted 
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n 


'it r it i i 

of trajectories, T (xi , x 2 , x 3 , . . . x^ ) such that TI : 1 -P(x i <x i <x i ) I 

i= i 

= . 05 where x^ and x" are perturbed values of xi (i = 1, 2, • • • , n) 
such that x* - fix* = (jlxi - xV . This universe or family of trajectories 
we called "5 percent level trajectories". 

For example if a trajectory is to be obtained by the deviation 
of one parameter from a nominal value 1 -P(x"<xi <x j [) - . 05 or 
P(x^ f <xi <x{) = .95. Thus Xj =|jl + 1. 96 cr and x" - \x - 1. 96 tr or xj is the 
sum of the nominal value of the variable and approximately two stan- 
dard deviations of the variable. 

If a trajectory is to be obtained by the deviation of two parameters 

r 'if i 

thenU-P(-x”<xi <xj) j l-P(-x”<x 2 <xpj = .05. An infinite number of 
combinations of xj and x 2 could be assumed to give these trajectories. 
For example if xi and x 2 are nominal values then five possible combi- 


nations are: 


X! + . 6 7 <r xi , x 2 + 1 . 65o- x2 
xi + <r xi , x 2 + 1.4Z<7 x2 

xi + 1.22cr xi , x 2 + 1.22«r 

xi + 1.42or xi , x 2 + cr x2 

xi + 1 . 65o- xl , x 2 + . 67o- x2 

THE VARIANCE OF CUTOFF ERRORS 


Now let us define y to be a variable representing any one of the 
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cutoff conditions: radius, flight path angle , velocity, or orbital inclina- 
tion. Then by the variance of y, < r\ we mean the average of the 
squares of the deviations of y from the desired cutoff condition. 

We observe that, on the average, trajectories formed by small 
deviations of parameters from the nominal, give cutoff errors close 
to those given by the nominal. Since variations in a parameter are 
considered to be normally distributed about the nominal value as a 
mean, a large percentage of the trajectories will have cutoff errors 
about the same size as those for the nominal. A comparison of a 
sample of 5 percent level trajectories with the nominal clearly indi- 
cates that the variance of any cutoff condition for the 5 percent level 
trajectories will be greater than the variance of the universe of all 
optimum trajectories. 

The next step in our experiment consisted of selecting a sample of 
forty-six (46) trajectories from the universe of all 5 percent level 
trajectories. Eleven (11) of these trajectories were generated by de- 
viating one parameter from the nominal. The eleven parameters were 
chosen because they seemed to produce the largest cutoff errors. The 
parameters selected to generate these off-nominal trajectories were: 
stage 1 thrust, specific impulse, and inertia weight; stage Z thrust, 
specific impulse and inertia weight; stage 3 thrust, specific impulse 
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and inertia weight along with head wind and left cross wind. Of these 
eleven parameters the ones that produced the largest errors we„re then 
combined using the probability theory indicated previously. Fifteen 
(15) trajectories were generated by deviating two parameters from 
their nominals such as stage 1 thrust along with stage 2 inertia weight. 
Twenty (20) of the trajectories were generated by deviating simultan- 
eously three parameters from their nominals. As indicated a special 
effort was made in the selection of the parameters and combination of 
parameters to select those that would make as large as possible the 
errors in cutoff conditions. For example in a combination of para- 
meters the direction of the variations were selected so that the result- 
ing errors would be in the same direction. To summarize, the forty- 
six (46) trajectories used in the sample were selected by the Saturn 
Booster Branch of the Boeing Company to be 5 percent level trajectories 
that would produce the largest errors in cutoff conditions. 

Thus 

c 2 (Sample)> cr 2 (5 Percent Level) 

y Y 

o" 2 (5 Percent Level) > cr 2 (Universe) 

y y 

cr 2 (Sample) > <r 2 (Universe) 

y y 

This selection technique as described produces a sample which 
will have a variance that can serve as an upper bound for the variance 
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of the universe of all trajectories generated by allowing parameters 
to assume off-nominal values. We could call this " a 95 percent level 
upper bound" since the probability that parameters will deviate from 
the nominal by more than the parameters used to obtain this upper 
bound in less than .05. 

However, due to the cost involved it was decided that the size of 
the sample was too large. From a careful study of the data a more 
select sample of size 10 was chosen. Once again the parameters that 
were chosen for this sample were selected because they induced the 
largest errors in cutoff conditions. A comparison of the variance of 
the sample of size 10 with the variance of the sample of size 46 is 
given in the following table: 


Cutoff 

Condition 

Si 

(Size 10) 

s| 

(Size 46) 

sf 

F(97. 5%) 

Radius 

1,833, 578 

593, 310 

3. 09 

2. 43 

Velocity 

. 938 

. 3462 

2. 72 

2. 43 

Flight Path 

Angle 

. 02566 

. 0083 

3. 08 

2. 43 

Orbital 

Inclination 

. 0000053 

. 0000166 

. 32 

2. 43 


Under the assumption that the two samples come from populations 
with equal variance there is less than a 2. 5 percent chance of getting 
a variance in a sample of size 10 that deviates as much from the sample 
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of size 46 as the sample selected. This is true for three cutoff condi- 


tions: radius, velocity, and flight path angle. Hence, it can be stated 
that the variance of the selected sample of size 10 is an upper 

bound for the variance of the universe of all trajectories in the volume 
relative to these cutoff conditions. It is probably true that the selected 
sample of size 10 produces an upper bound for the variance of the vari- 
able, orbital inclination. 

The ten optimum trajectories that were selected because they 
seemed to produce extremely large cutoff errors were generated by: 

(1) 2a head wind, (2) 2c r stage two thrust, (3) 2c r stage two inertia 
weight, (4) -1. 2a stage two thrust along with 1. 2cr stage two inertia 
weight, (5) 1. 2cr head wind along with 1. 2a stage two inertia weight, 

(6) 1. 2a stage two inertia weight combined with 1. 2a stage three iner- 
tia weight, (7) 1.2a head wind and a -1. 2a stage two thrust, (8) . 9a 
head wind, -.9<r stage two thrust, and .9a stage two inertia weight, 

(9) . 9<r head wind, .9^ stage two inertia weight, and a . 9cr stage three 
inertia weight, (10) -.9<r stage two thrust, .9a stage two inertia weight 
and a . 9o" stage three inertia weight. 

VARIATIONS IN LAUNCH TIME 

Note that in all the precedirg discussion, we have considered a 
fixed launch time. Let us now extend our sampling process to the 
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entire launch window. We will design our procedure to involve running 
simulated flights at seven different launch times in the launch window; 

_30, - 20 , -10, 0 , 10, 20 , 30 minutes. The results of the data 
obtained from this experiment will be used to construct upper bounds 
for confidence limits for the errors of a given cutoff condition. The 
confidence limits obtained will cover the complete launch window and 
include all possible trajectories that would be included by chance 95 
percent of the time. 

The variance of the seven samples taken at the seven different 
launch times will be pooled or averaged together to give an upper bound 
for the variance of the universe of all optimum trajectories throughout 
the launch window. We make use of the following formula for this 
analysis. 

2 _ (ni- 1 ) si + (n 2 - 1 ) S 2 + • . . (ng -) s? 

S? m + n 2 + . . . + 117 - 7 


Using S 7 as an upper bound for the variance of the universe of all 


optimum trajectories, the following upper bounds are obtained for 2 cr . 


Cutoff Condition 
Radius 
V elocity 

Flight Path Angle 
Orbital Inclination 


Upper Bound for 2 cr 
2656 mete rs 
1.8544 meters/sec 
. 3142 degrees 
. 0458 degrees 
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The process by which we selected our samples indicates that a 
somewhat smaller upper bound can be obtained by using statistical 
theory utilizing the range. At the same time it is evident that the 
range of our selected sample of size ten at a given launch time is un- 
doubtedly an upper bound for the range of random sample of size ten 
from the universe of all trajectories at a given launch time. 

Thus, let us assume that we are taking random samples of size 
10 from a universe made up of optimum trajectories generated by 
allowing the vehicle and flight parameters to vary. Since our samples 
are small (of size 10) the range and standard deviation of a sample 
are likely to fluctuate together. Thus the range may be used to esti- 
mate variance with little loss of efficiency. Once again, assuming 
that the errors for each variable are normally distributed, we utilize 
tabulated tables for the w distribution where w = R / cr. 

To estimate the standard deviation of the universe we calculate 
the average range of the 7 samples. Call this value, R. For samples 

of size 10 the expected value of w is 3. 078. Thus an estimate of the 

R 

standard deviation of the universe is given as — — — — . In other words 

3.078 

to take R to be an estimate of the mean value of the range of all samples 
of isize 10 is the equivalent of taking the standard deviation of the uni- 
verse to be R/ 3. 078 . Since the range for each of the selected samples 
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at different launch times is an upper 


bound for the range of random 


samples at these times, then 

~ R 

E(<x) _ _JL (Random Samples)< — (Selected Samples) 

' ' 3. 078 S. U (a 

Thus R / 3. 078 for our selected samples gives an upper bound for cr of 


the universe. 

Cutoff Condition 

Radius 

Velocity 

Flight Path Angle 
Orbital Inclination 


Upper Bound for 2cr 

Z100 meters 
1.615 meters/sec 
. 2692 degrees 
. 026 degrees 


VARIABILITY TESTS 

In the preceding theory we have made the basic assumption that 
the variability of output errors remains constant during changes of 
launch window. In particular we have made this assumption for the 
universe from which we picked out "selected sample". We wish now 
to test whether or not the variability of output errors remains constant 
across the launch window. As a word of caution it should be remem- 
bered that if the variation as indicated by the samples should prove to 
be significant it could be due to departure from normality within the 
groups rather than departure from heterogeneity. 

2 

m max Si 

Two tests will be made for each variable. The — — r will afford 
a quick test for comparing the variance estimates. Cochran's Tests 
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„ , . r . max Si , 

for the homogeneity of variance — — — z — tests whether one variance is 

" s i 

significantly larger than the others. 


Variable 

Max Si 
Min S* 

5% Point 

Result 

Max S| 

ssi 

5% Point 

Result 

r 

1.06 

.7.42 

not 

significant 

.15 

. 315 

not 

significant 

9 

1. 59 

7.42 

not 

significant 

. 156 

. 315 

not 

significant 

V 

1. 17 

7.42 

not 

significant 

. 159 

. 315 

not 

significant 

i 

16. 2 

7.42 

significant 

. 54 

. 315 

significant 
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DERIVATION OF A GUIDANCE FUNCTION MODEL 

BY 

R. E. WHEELER 
SUMMARY 

This report presents the derivation of a mathematical model for 
fitting the steering function. The solution gives cot 6 (0 = +cj> where i)j 
is the steering angle and 4> = arc tan ^ ) as a function of time and state 
variables along the trajectory. This function evaluated at t = t Q should 
be the desired steering function. No end conditions were considered as 
all constants of integration were combined with unknown constants in 
the power series expansion; it is proposed that curve fitting techniques 
will be used to obtain these constants. 


i 
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INTRODUCTION 


This paper is concerned with the development of a mathematical 
model that may be used as a guidance function. The basic requirement 
of a guidance function is that it instantly converts sensed state varia- 
bles of the vehicle into command signals to enable the vehicle to follow 
a newly selected optimal path. 

At this time multivariate polynomials are being used to express 
the guidance parameters in terms of the state and performance varia- 
bles of the vehicle. From all reports these polynomials seem to be 
adequate to represent the problem encountered at this time. However, 
at several meetings of those concerned with this phase of the guidance 
problem, opinions have been expressed that some other type function 
might better represent the relationship between guidance parameters 
and state variables. 

So this research was motivated by the problem of trying to develop, 

if possible, a function or form of a function that would represent the 

relationship between the state variables and guidance parameters. The 

model that will be obtained will have three important properties. First 

of all its form will not be assumed in any way. Secondly, the functional 

relationship will be developed from the equations that define the motion 

and the conditions that insure an optimum trajectory. Finally, the 

relationship will contain a number of undetermined coefficients which 
will need to be obtained by some method of curve fitting techniques. 
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THE PROBLEM CONCEPT 


In this paper a mathematical pnodel defining the steering function 
in terms of instantaneous state variables is developed for the powered 
flight problem defined as follows: 

1. Motion is assumed to occur in a vacuum. 

2. Only two dimensional motion is considered. 

3. Rigid body dynamics is neglected. 

4. The earth is assumed to be spherical and homogeneous. 

5. A constant applied force (F) is considered. 

6. The time rate of change of the mass (m) of the vehicle is 
constant. 

Langrangian Multipliers are used to formulate necessary conditions 
for extremizing some variable such as propellant consumption or burn- 
ing time. 

Equations Defining the Problem 

The differential equations which define the motion of the vehicle 
may be written as: 


F , kx 

x = — sin ip - — r 

m r J 

F ky 

y = — cos w - — T 

} m r 3 


( 1 ) 


The coordinate system, x, y is chosen so that x is parallel to the 
surface of the earth, and y is perpendicular to the surface. The dot 
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represents differentiation with respect to time. F is the thrust magni- 
tude which we assume to be constant. Likewise, k is considered as a 


given constant. The mass, m, is of the form m + m t where m and 

o o o 

m are considered as constants. The control variable is di , the direc- 
o x 

tion of the thrust vector measured positive from the upward vertical. 
Now consider the change of variables defined by 
x = r cos (]) xj=r x 3 = r 

y = r sin <j> x 2 - r cf> 0 = \\t + cj> 

Under this transformation equations (1) become 


( 2 ) 


xi = 


2 

X Z 

X 3 


k ^ F . „ 

-2 + — sm 0 
x 3 m 


xi x 2 , F 

x 2 - - + — cos 0 

x 3 m 


( 3 ) 


The function whose time integral is to be extremized may be defined 
as G = 1 + ZJ.X.g. where X. are the undetermined Lagrangian Multipliers, 


and 


x 2 k F . 

gi = xi - + “2 - — sm 0 = 0 

x 3 x 3 m 

g 2 = x 2 + -"- 2 " - — cos 0=0 

x 3 m 


g 3 = x 3 - xi = 0 


( 4 ) 


g4 

g5 


k4 = 0 

X 3 

m - m 0 = 0 
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Applying the Euler- Lagrange conditions to the function G results in 


the following system of equations: 


Xl = 

x 2 k 

— - "7 


sin 0 



( 5 ) 

x 3 x 3 

m 





X 2 = 

-xi x 2 

F 

+ — 

cos 0 



( 6 ) 

x 3 

m 





hi = 

h 2 

- x 3 




(7) 

x 3 






h 2 = 

-Zx 2 

Xi + 

^ h 2 

1 

- - h 4 


( 8 ) 

X 3 


x 3 

X3 



u = 

[l^j 2 

2 k 1 

Xi 

Xl X 2 , 

2 h 2 

+ ^ h 4 

( 9 ) 


jjx 3 l 

x! J 


X? 

x 3 


*4 = 

0 





(10) 

h 5 = 

F 

— 2 sin 0 Xi 

F 

+ 2 1 

cos 0 h 2 


(ID 

m 


m 




F 

- — cos 8 Xi 

+ *: 

sin 0 X 2 

= 0 


(12) 

m 


m 




Since G is 

; explic 

itly independent of the independent variable, 

t, a 


first integral of the system can be shown to be 


hi xi + h 2 x 2 + h 3 x 3 + h 4 X4 + h 5 m - ci ( 13 ) 

F 

By substituting in equation ( 11 ) the values given for — sin 0 and 

P 

— cos Gin equations ( 5 ) and (6) the following relationship is obtained: 
m 

\i xi + X 2 x 2 “ ~ Xi + M “ ^2 = m X5 ( 14 ) 

x 3 x 3 x 3 

By eliminating Xi xi + X 2 x 2 between equations ( 13 ) and ( 14 ) the result 
may be expressed as 

— hi - — z hi - Xl — — + xi X 3 + X4 h4 + mhs + mis' = ci ( 15 ) 
x 3 x 3 x 3 
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Now multiply (7) by xi , (8) by x z and (9) by 2 x 3 > and add to obtain 

xi X.i + X2 V.2 + 2x3 X.3 = - xi X-3 - ~r Xi + — X4 and 

x 3 x 3 

integrate both sides to obtain: 

f : 

xi X-l + X2 X.2 + 2X3X3 =j pi Xl + X2 X2 + 2X3X3-X1 X3 

- — 2 ~ Xi + — ~ X4I dt + C3 
x| x 3 J 

This simplifies (by using 13) to 

xi Xi + X2 X2 + 2x3X3 


( 16 ) 


(17) 


= /i Xidt -/ x 


o Xi dt - 1 X 5 m dt+ci t + C 3 

xf 


(18) 


The limits of integration throughout this development are from t 

to t . The values of all state variables at t_ are lumped together as 
c 

one arbitrary constant. 

Now solve xi X 3 in equation (15) and add this result to X 3 X 3 as 
found in equation ( 9 ) to obtain: 


• • • • A. 

x 3 + x 3 X 3 + m\s + m \5 = Ci - ^2 M 

which after integration with respect to t becomes 




x 3 \ 3 + m\$ = -| ^ Xi dt + ci t + C4 


(19) 


( 20 ) 


Substituting the result obtained ^ or l Xi dt in (Z 0 ) into equation (18) 
yields 

xi Xi + x 2 x 2 - 2 x 3 X 3 = 4m X 5 - 3ci t + c 5 |mX 5 dt (21) 

Now replace x 3 X 3 with the value given in equation (7); simplify and 
integrate to obtain 

f f C * • q ? 

x 3 Xi -1x3 X 3 dt = / 4 m X 5 dt - im X 5 dt - / 2 c it + c 5 t + c 6 

J J Jj 

( 22 ) 
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Integrate j x 3 \ 3 dt by parts to obtain using equation (ZO) 

■/« 


X 3 X 3 dt = X 3 X 3 t +1 


dt - — ~ — + m X .5 t -l(m\ 5 )dt (23) 

X3 2 


Now apply the mean value theorem for integrals to the integral ~ 


W rite 


ite / 

K 


kt \i dt 


x| 


as r 

kXi 


p kt \i dt 
x! 


X 3 


+ C 7 where the interval from t to 


t is picked so that — =r- does not change sign in the interval t t ] . Then 

P pj 


r 


c kt \\ dt 


Xo 




can be written as a 
t 

/ 

c 8 where c 8 = c 7 - a 
k \i dt 


4 


r 


p k \\ dt 


Xo 


+ c 7 = a 


1 


c k \i dt 


X 3 


k \i dt . 

1 and t < a < t . 


x 3 


—2 by its value in equation (ZO). The 

*3 


fkt \i 
n tt 

J x 3 


\i dt 


Now replace 


- - ax 3 \ 3 - 


am \ 5 + aci t + c 9 

Substituting this result in equation (Z3) and then in (ZZ) yields 
x 3 \i + (a-t) x 3 X 3 - dt -Jm \ 5 dt + m \ 5 (cio-t) 

- ci 1 t 2 + Cl z t + Cl 3 


(24) 


(25) 


Equations (15), (21) and (25) can be so arranged that the right side 

m- mo 


of the equations are functions of m , t, , and \ 5 . Since t = 
the right sides can be considered as functions of m^ X 5 \ 5 


m 

Now 


remember, in this study we are not attempting to solve these equations 
but to find a model with undetermined coefficients that will satisfy the 
equations. Thus to find such a model we now assume that the right 
side of each equation can be expressed as a power series in 


m 


F 

— to give 



k] 

XI X 2 , , 

Xi - X 2 + XI X 3 

X 2 

+ c 

00 

= S bi 

( £ ) 

\X 3 

x 3 J 

x 3 

X 3 

i — 0 

\ m | 


(26) 


00 

XI Xi + x 2 X 2 - 2 x 3 \ 3 = S c i 

i-o 


00 


F l 1 

— « x 3 Xi + x 3 (a-t )\ 3 = E di 

i = o 


FJ 
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The bi, Ci and di are unknown coefficients, which when they appear 
in the model, will need to be evaluated by some curve fitting technique 
that fits the model to the space of optimum trajectories. Now consid- 
ering these equations as three equations in unknowns \\ , \ z , \ 3 , we 

solve for \\ , and \ z . Since cot0 = ““^ the results will be expressed in 

M 

terms of cot 0; cot 0was selected rather than tan 0 to make the denom- 
inator as simple as possible. 




cot 0 =j(x 2 ~ ~ ) 
x 3 


/F) 1 , 

fa + 


m 


— I + XI x 3 2r i — 


f\ 1 


m 


Ifi 1 { 2 k 

. , — ! + Cl (X2 - — I t + C2 XI X 3 t + C 3 Xi X2 t+ C 4 XI X2 


+ x 3 Ssi 


+ c 5 X 2 x 3 / XI x 2 Sgi 


X 3 


F 

m 


IF 1 

+ x 2 x 3 2hi I — 


m 


2 2 

+ CfcXl x 2 t + C 7 X Z X 3 t + C 8 X 2 t + C 9 X 2 


( 27 ) 


In recent guidance polynomials the series have been truncated after 


second order terms involving — . If we assume that all terms involving 

2 


— of the rational function are dropped off after 
m 


— , equation ( 27 ) 

m 1 


becomes 

cot 0 = 


2 2 k 7 

a i XI + a 3 xi x 2 + a 4 x 2 x 3 + a 5 xi x 3 + a 6 (x 2 - — )+ a 2 x 3 

x 3 

F 

xi 

x 3 


2 k 

+ a 7 xi x 2 t + a 8 xi x 3 t + a 9 (x 2 - — ) t + ai 0 


F \ 

+ ai 1 xi x 3 ]— j + ai 2 x 3 


m 


— + ai 3 (x 2 - — ) — + ai 4x1 


m 


+ ai 5 xi x 3 


/F'l . 2 

— + ax 6 x 3 


m 



F 

m 


ai 8 xi x 2 + ai 9 x 2 x 3 + a 2 o x 2 + a 2 1 xi x 2 t + a 2 2 x 2 x 3 t + 


a 2 3 x 2 t + a 2 4 xi x 2 


[F| 





f| 

i F 1 

^ /F | 


+ a 2 5 x 2 x 3 — 

+ a 2 6 xi x 2 — 

m J 

\ m j 

H 


( 28 ) 
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Of course the assumption that the series involving — can be trun- 
cated after second order terms may or may not be correct. The order 
of the terms at which all other terms would be insignificant would need 
to be determined by considering problems on an electronic computer. 

Thus the cot 0 is expressed in terms of a rational function of state 
variables involving 17 terms in the numerator and 10 terms in the de- 
nominator. To use this function as a guidance function would require 
that 29 constants be found by some curve fitting technique such that the 
function would give a good representation of the space of optimal tra- 
jectories. 

It is of interest to investigate what additional assumptions will need 

to be made in order to change our rational function to a polynomial. 

This can be accomplished in either of the following ways. 

In reference ( 1 ) Mr. Moyer considered the equations ( 4 ) without the 

condition that g4 = X4 - — = 0 . If we take the assumption of Mr. Moyer 

x 3 

we will be able to obtain a polynomial. 

If we are not able to make the above assumption then we can still 

obtain a polynomial by certain assumptions relative to equation ( 15 ). 

In this equation group the X4 ^4 (which can be written as c — on the 

x 3 

right side of the equation to obtain: 

X2 , k X1X 2 , , „ CX 2 , . x 

\l - 1 \i - a.2 + XI X3 = Cl - + m\ 5 + m\ 5 

X3 X3 X3 X 3 

C ? X? • • 

Now assume that ci + m\ 5 + m\ 5 can be written as a power 

x 3 
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JJ f 

series involving — Under this assumption •, then equation (27) may 


be simplified in the following manner. Multiply the numerator and de- 
F 

nominator by — ; divide by xz X 3 ; and then divide the numerator and 
m 

l f i 1 

denominator by the power series Shi — ! to obtain: 


\m 1 


cote = 




( 4> 2 -- 3 ) 

S ai 

l£f + 

|r| 

Sbi 

fF 

1 

+ £ ' 

S C 1 

[£l 

1 

* + «* 

[— ri 

r 5 

H* 

II 

O 

'< m • 

ki_ 

i=i ' 

Ej 

U-V, lr ' 

0 

II 

•H 

mi 

II 

•H 

mj-J 


1 +i-ir=v 


m 


(29) 


or 


cot 0 = 




f (i 2 

-K) 

Sfi 

F 1 

1 r 

+ - 

2gi 

F 

i 

+ 

Shi 

F 1 

1 r 

+ ” 

Sji 

F Y 


r 

i = 0 

mi 

r 

i=l 

[rn\ 


i=0 

m 

r 

i=l 

m 


1 +- Ski 


f r 

m 


(30) 


These two expressions for cot 0 are equivalent. The terms in the 


r 2 e * 

denominators are reciprocals. Now if — 1 

r i = 0 


— J converges to a 


number in absolute value less than 1, the denominator can be expanded 


one then T Ski 
r 


m 


F 1 


m 


is greater than 


as a negative binomial in the numerator. If — S eA 

r 1 

F \ 1 

I is less than one and the second denominator can 


be expanded as a negative binomial in the numerator. Thus we have 

\ 

r S e * / f 1 * 

two polynomials representing cot ©depending on whether — 1 1 — 1 

r i= 0 \mj 

r ^ / p | 

is less than or greater than one. Since in general the size of— e M — 

r i= 0 \mj 

is not known, we write down all the terms contained in both polynomials 
Notice that the model obtained involves r, r and 4> and could possibly 
reduce the number of terms needed in a polynomial guidance function. 
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Daniel E. Dupree, James O'Neil, Edward Anders 


SUMMARY 

From a given vector derived previously, an ideal function is developed 
which satisfies a specified least squares error tolerance. 


THE FUNCTION 

In the vector cp^ +1 = (X Q , X^, ... , X n ), computed in [ 3 ] , suppose we 
let \ i he the value of some ideal function cp^ + ^(p) at p^; i.e., cf^ + -^(P^) = 
X^. Then this ideal function assures us that the error E, where 

E = - W% + l (P i> J ’ 

is less than the imposed tolerance 5. Since we know the values of this 
ideal function at the tabular values p_^, our next objective is to develop 
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a technique for computing cp N+1 ( P ' ) > for some value p* £ i = 0,1,..., n, 

N+l 

such that the error obtained by using £ A.cp.(p') to approximate X(p'), 

5=0 J J 

in the sense of least squares, is as small, if not smaller, than the error 

N 

obtained by approximating X(p ' ) with £ A cp.(P')* We obtain this value 

j=0 J J 


(p 1 ) in the following manner. 

First, we compute ^^(k), k = -1, 0, 1, ... , N, e N+1 and A^ +1 as 
follows : 




N 


%+l 


‘ ^ ^%+l’ S j 


V’ l)( %fl' C? 0 ) 

* 

N-l 

VkL^ = A N+1^' 1) V -1 ^%+1' ^N+l^ V j) * 

N 

e N+l = %+l “ ,1 Vl-1^ S j 

^N+l = e N+l‘ ) * 


Finally, compute the (N+2) A. t s, j =0, 1, • , N+l, as follows: 

\+l = ^N+l ^N+l^ 
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*], - V -1 ) [/s ■ • # N+1 Vt-l® 1 

\-l - W' 1 ' W-l - ^ V"- 15 + [- A N + 1 (N ' 1) + ] }> 


Now let p. . be a p. such that || p , - P’ || = min { || P, - P' 

1 1 1 0 < i < n 


and let us define the following function: 


£,Vj (p,) + A N+l M(p,) ^ 

L(P ±t ) " 2 || p - P' 

where M(p') = lIpTH 


for 2 || p lt - P' || < L(p ±l ), 


= 0, otherwise. 


where L(P i , ) = 


0 < i < n 


II Pi - Pi« II } 


Thus, when p » is chosen, we are able to use the function above to 
approximate X(p' ) , being assured that the approximation obtained here is 

N 

no worse than the value £ A .cp. ( P ' ) obtained by using the initial least 


squares approximating function. 


Writing this multiple of X^ as 


iiOi.) - II e t , - P' II 
I L (Pi- > 

we see that we have a factor which varies from zero to one as p' varies 
from a position on the boundary to a position at the center of the ball 

{p | II Pf - p' II < l L <Pi.> } • 

Thus, the factor X ± , which was derived in association with the vector P it 
is weighted depending on the nearness of P' to . 

For a particular p 1 , we may have a possibility of multiple choices 
for p . , . Perhaps, more than one of these would satisfy 

2 || P^ - P' II < L(P.,). 


This situation depends on the configuration of the P^'s and on the 
orientation of p' with the p^s near it in the norm sense. 

Suppose there are m choices of P^ f , an ^- r them satisfy 


2 1| p., - P' || < L(P.,). Let 0 , 0 ± j, ... , P i; , denote these Pi.' 8 * 

and M( p ' ) = 


Let H. , 
t 


X., . H 
H t 

o o 


max 

1 < t < r 


L(p.,) - 2 || p. f - P’ || 

"t 


L(P.,) 

x t 


Thus we orient p« with the p ± which exerts the most influence 


on P'. 
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